Isolated polypeptide of the toxin A and toxin B proteins of C. difficile and uses thereof

ABSTRACT

This present invention provides C-TAB.G5 and C-TAB.G5.1 isolated polypeptides comprising the receptor binding domains of  C. difficile  toxin A and toxin B as set forth in the amino acid sequences of SEQ ID NO: 2 and SEQ ID NO: 4. The C-TAB.G5 and C-TAB.G5.1 isolated polypeptides may be used to neutralize toxic effects of  C. difficile  toxin A and/or toxin B.

RELATED APPLICATIONS

This application is a division of U.S. application Ser. No. 14/342,565, filed Oct. 28, 2014, which is a national stage filing under 35 U.S.C. § 371 of international application PCT/EP2011/065304, filed Sep. 5, 2011, which was published under PCT Article 21(2) in English, the disclosure of each of which is incorporated by reference herein in its entirety.

FIELD OF THE INVENTION

The present invention relates to an isolated polypeptide containing the receptor binding domains of the Clostridium difficile toxin A and toxin B and its use as a vaccine. This isolated polypeptide provides anti-toxin immunity to both toxins.

BACKGROUND OF THE INVENTION

Clostridium difficile is the leading cause of nosocomial antibiotic associated diarrhea and has become a major health problem in hospitals, nursing home and other care facilities. The cost to hospitals has been estimated to be 2 billion dollars in Europe and 3.2 billion dollars in the United States.

The causative agent is a gram positive, spore forming anaerobic bacterium, commonly found through out the environment but also present in the intestinal tract of 2-3% of the healthy adult population. C. difficile associated disease (CDAD) is induced by the disruption of the normal colonic flora, usually the result of the administration of antibiotics. Following exposure to C. difficile spores in the environment, the organism may colonize the intestinal mucosa where the production of disease causing toxins can result in CDAD. Disease may range from mild uncomplicated diarrhea to severe pseudomembranous colitis and toxic megacolon.

CDAD has become increasingly more problematic in health care settings. A recent study reported that 31% of hospital patients who receive antibiotics become colonized with C. difficile and 56% of those patients who become colonized go on to develop CDAD. Overall, C. difficile is responsible for 10-25% of all antibiotic associated diarrheas, 50-75% of antibiotic related colitis and 90-100% of antibiotic related pseudomembranous colitis. Treatment of CDAD involves discontinuation of the causal antibiotic followed by treatment with either metronidazole or vancomycin. Relapsing after antibiotic treatment is discontinued occurs in approximately 20% of patients, often the result of recolonization by C. difficile.

In 2003, a C. difficile outbreak in Quebec, Canada indicated the emergence of a more virulent strain of C. difficile known as North American Phenotype 1/027 (NAP1). NAP1 has been associated with greater virulence, poor outcomes and greater morbidity and mortality rates compared to previous strains. The emergence of this strain adds to the problems already encountered in trying to contain the incidence of CDAD.

Fidaxomicin (Dificid©) for prevention of recurrent disease is the first in a new class of narrow spectrum macrocyclic antibiotic drugs (Revill, P.; Serradell, N.; Bolos, J. (2006). “Tiacumicin B: macrolide antibiotic treatment of C. difficile-associated diarrhea”. Drugs of the Future 31 (6): 494-497). It is a fermentation product obtained from the actinomycete Dactylosporangium aurantiacum subspecies hamdenesis. Fidaxomicin is non-systemic, meaning it is minimally absorbed into the bloodstream, it is bactericidal, and it has demonstrated selective eradication of pathogenic Clostridium difficile with minimal disruption to the multiple species of bacteria that make up the normal, healthy intestinal flora. The maintenance of normal physiological conditions in the colon can reduce the probability of Clostridium difficile infection recurrence (Johnson, Stuart (2009-06). “Recurrent Clostridium difficile infection: a review of risk factors, treatments, and outcomes”. Journal of Infection 58 (6): 403-410). Although it is thought, that the introduction of this new class of antibiotic drug will improve the treatment of CDAD, there is still a medical need for a preventative drug, in particular for high risk patients such as the elderly and the immunocompromised patients.

CDAD is the result of the actions of two exotoxins produced by C. difficile, toxin A and toxin B (also referred to as CTA and CTB, respectively). Both toxins are high molecular weight (˜300 kDa) secreted proteins that possess multiple functional domains (Voth D E and Ballard J D, Clinical Microbiology Reviews 18:247-263 (2005)). The N-terminal domain of both toxins contains ADP-glucosyltransferase activity that modifies Rho-like GTPases. This modification causes a loss of actin polymerization and cytoskeletal changes resulting in the disruption of the colonic epithelial tight junctions. This leads to excessive fluid exudation into the colon and a resulting diarrhea. The central domain contains a hydrophobic domain and is predicted to be involved in membrane transport. The C-terminal domain of both toxins contain multiple homologous regions called repeating units (RUs) that are involved in toxin binding to target cells (Ho et al, Howell 102:18373-18378 (2005)). The repeating units are classified as either short (21-30 amino acids) or long (˜50 amino acids). Repeating units combine to form clusters, each usually containing one long and 3-5 short repeating units. The full length toxin A possesses 39 repeating units (ARUs) organized into 8 clusters (Dove et al. Infect. Immun. 58:480-488 (1990), while the full length toxin B contains 24 repeating units (BRUs) organized into 5 clusters (Barroso et al., Nucleic Acids Res. 18:4004 (1990); Eichel-Streiber et al., Gene 96:107-113 (1992)).

A number of studies, from both animal models and from the clinic, have indicated a role for anti-toxin antibody in the protection from C. difficile associated disease. Hamsters immunized with formalin inactivated toxin A and toxin B generated high levels of anti-toxin antibody and were protected from a lethal challenge of C. difficile bacteria (Giannasca P J and Warny M, Vaccine 22:848-856 (2004)). In addition, passive transfer of mouse anti-toxin antibody protected hamsters in a dose dependent manner. Kyne L et al. (The Lancet 357:189-193 (2001)) reported that the development of an anti-toxin A antibody response during an initial episode of CDAD correlated with protection against disease recurrence.

The determinants recognized by protective anti-toxin antibodies have been localized to the C-terminal domain containing the reating units which function as the receptor binding domain. Initially, Lyerly et al. (Current Microbiology 21:29-32 (1990)) revealed that the toxin A C-terminal domain containing 33 repeating units is capable of inducing the production of neutralizing anti-toxin antibody and may protect from C. difficile infection. In this study hamsters were injected subcutaneously with the purified recombinant polypeptide multiple times prior to challenge with the bacteria, however only partial protection was achieved. Another study (Ryan et al., Infect. Immun. 65:2941-49 (1997)) showed that the isolated polypeptide containing 720 amino acid residues from the C-terminus of CTA and the secretion signal of E. coli hemolysin A (expressed in Vibrio cholerae) induced protective systemic and mucosal immunity against a small dose of CTA in the rabbit CDAD model.

It was also reported that antibody response against the C-terminal domain of both toxin A and B was necessary to achieve full protection (Kink and Williams, Infect. Immun. 66:2018-25 (1998), U.S. Pat. No. 5,736,139 (1998)). This study revealed that the C-terminal domain of each toxin was most effective in generating toxin-neutralizing antibodies. It demonstrated the effectiveness of orally deliverd avian antibodies (antitoxin) raised against C-terminal domain of CTA and CTB in the hamster lethal model. The results also indicate that the antitoxin may be effective in the treatment and management of CDAD in humans. In another study, human anti-toxin A and B monoclonal antibodies were reported confer protection against C. difficile induced mortality in hamsters (Babcock et al., Infect. Immun. 74:6339-6347 (2006)). Protection was only observed by antibodies directed against the receptor binding domain of either toxin and enhanced protection was observed following treatment with both anti-toxin A and B antibodies.

On the other hand, Ward et al. (Infect. Immun. 67: 5124-32 (1999)) considered 14 repeating units from C. difficile toxin A (14 CTA) for the study of adjuvant activity. The repeating units were cloned and expressed either with the N-terminal polyhistidine tag (14 CTA-HIS) or fused to the nontoxic binding domain from tetanus toxin (14 CTA-TETC). Both fusion proteins administered intranasally generated anti-toxin A serum antibodies but no response at the mucosal surface in mice. Enhanced systemic and mucosal anti-toxin A responses were seen following co-administration with E. coli heat-labile toxin (LT) or its mutated form LTR72. Based on the data, Ward et al. suggested using non-toxic 14 CTA-TETC fusion as a mucosal adjuvant in human vaccine directed against clostridial pathogens.

Recent biochemical studies on the repeating unit domains of C. difficile toxins has looked at the minimal sequence requirements for forming stable tertiary structure (Demarest S J et al., J. Mol. Bio. 346:1197-1206 (2005)). An 11 repeating unit peptide derived from toxin A was found with a correct tertiary structure but 6 and 7 repeating units from toxins A and B did not. The correctly folded 11 repeating unit segment was found to maintain the receptor binding property. A second study examined the functional properties of toxin A fragments containing 6, 11 or 15 repeating units (Dingle T, Glycobiology 18:698-706 (2008)). Only the 11 and 15 repeat units were capable of competitively inhibiting the toxin neutralizing ability of anti-toxin A antibody. While all 3 fragments were found to have hemagglutinating activity, the longer fragments displayed higher hemagglutinating activity than the shorter ones. The data indicates that toxin receptor binding domain structure and immunogenicity are retained in domain fragments that contain greater than 11-14 repeats.

Thomas et al. (WO97/02836, U.S. Pat. No. 5,919,463 (1999)) also disclosed C. difficile toxin A, toxin B and certain fragments thereof (e.g., C-terminal domain containing some or all of the repeating units) as mucosal adjuvants. They showed that intranasal administration of CTA or CTB significantly enhanced mucosal immune response to a heterologous antigen such as Helicobacter pylori urease, ovalbumin, or keyhole limpet hemocyanin (KLH) in multiple mouse compartments and was associated with protection against the challenge with Helicobacter. Additionally, the adjuvant activity of a toxin A fusion protein was evaluated: 794 C-terminal amino acid residues of CTA comprising ARUs (toxin A repeating units) were fused to glutatione-S-transferase (GST) and resulted polypeptide GST-ARU was expressed in E. coli. This study demonstrated significant enhancement of immune response by GST-ARU to co-administered antigens in serum and mucosal secretions.

All of these studies suggest potential use of a non-toxic, recombinant protein comprising either C. difficile toxin A, or toxin B, or fragments thereof, or their combinations for producing an active vaccine against CDAD. Currently, no vaccine against C. difficile is commercially available, although a candidate vaccine consisting of formalin-detoxified entire toxins A and B has been evaluated in human phase I and IIa studies. It is reported that parenteral immunization with this vaccine induces anti-toxin IgG and toxin-neutralizing antibody responses (Kotloff K L et al., Infect. Immun. 69:988-995 (2001); Aboudola S et al., Infect. Immun. 71:1608-1610 (2003)).

The literature further indicates that the construction of a recombinant fusion protein containing both toxin A and B receptor binding domains of C. difficile, either in their entirety or fragments thereof, would be an efficient and commercially viable approach for vaccine development. Such an approach has been attempted as a two part fusion protein of a 700 base pair fragment of toxin A and a 1300 base pair fragment of toxin B by Varfolomeeva et al. (Mol. Genetics, Microb. and Virol. 3:6-10 (2003)). This approach has also been described by Belyi and Varfolomeeva (FEMS Letters 225:325-9 (2003)) demonstrating construction of the recombinant fusion protein consisting of three parts: two C-terminal domains composed of repeating units of C. difficile toxin A and toxin B followed by the fragment of Clostridium perfringens enterotoxin Cpe. The fusion protein was expressed in E. coli but the product was accumulated in inclusion bodies and was not stable. Moreover, the yield of pure product achieved in this study (50 μg per 100 ml culture) was considerably low.

Wilkins et al. (WO 00/61762, U.S. Pat. No. 6,733,760 (2004)) also described the use of recombinant C. difficile toxin A and B repeating units (recombinant ARU and recombinant BRU) and their polysaccharide conjugates for the preparation of a vaccine against CDAD. The resulting recombinant ARU protein comprised 867 amino acid residues while the recombinant BRU protein contains 622 amino acids in length. Unlike the previously mentioned studies, this work demonstrated high-level expression of recombinant ARU and BRU soluble proteins in E. coli. Mice vaccinated with recombinant ARU and with polysaccharide-conjugated recombinant ARU both mounted a high level of neutralizing anti-toxin A antibodies and were highly protected against lethal challenge with C. difficile toxin A. In addition, Wilkins et al. suggested using a recombinant fusion protein consisting of both ARU and BRU for the preparation of a vaccine.

There is an interest in developing a vaccine against CDAD. A recombinant fusion protein consisting of ARU and BRU may be potentially useful as a vaccine.

SUMMARY OF THE INVENTION

The present invention provides new tools and methods for the design, production and use of the toxin A and toxin B from C. difficile. The present invention provides an isolated polypeptide C-TAB comprising SEQ ID NO: 2 (C-TAB.G5) or a derivative thereof, SEQ ID NO: 4 (C-TAB.G5.1). The C-TAB.G5 or C-TAB.G5.1 comprises 19 repeating units of the C-terminal domain of toxin A fused to 23 repeating units of the C-terminal domain of toxin B. The present invention also includes compositions and formulations comprising the C-TAB.G5 or C-TAB.G5.1 isolated polypeptide. The compositions or formulations may contain the isolated polypeptide, an additional antigen, an adjuvant, and/or an excipient. Alternatively, the compositions or formulations may consist essentially of the isolated polypeptide without an adjuvant or other active ingredients (but optionally comprising an excipient such as a carrier, buffer and/or stabilizer). Moreover, the compositions or formulations of the invention may be administered concomitantly with other drugs such as an antibiotic in particular e.g. in subjects with recurrent CDAD or in subjects requiring frequent and/or prolonged antibiotic use.

The present invention also provides a vaccine comprising the isolated polypeptide of the present invention. The vaccine may further comprise an adjuvant, such as such as alum, an adjuvant derived from an ADP-ribosylating exotoxin or others. The vaccine may be administered in a one dose regimen, two dose regimen (administered e.g. within 3 to 20 days, e.g. after 10 to 15 days of the first dose), three dose regimen (administered e.g. after about 7 days and about 21 days of the first dose), or more than three dose regimen, preferably a two or three dose regimen, wherein the dose comprises a 20 μg to 200 μg amount of the polypeptide of the invention.

The present invention provides a method of preventing, treating, or alleviating one or more symptoms of a disease, such as CDAD by administering the isolated polypeptide of the invention to a subject in need thereof. The C-TAB.G5 or C-TAB.G5.1 isolated polypeptide may be administered to the subject intramuscularly or by other routes of delivery.

In one embodiment, the present invention provides a method of preventing a disease, such as CDAD by administering the isolated polypeptide of the inventions or a composition comprising said polypeptide to a subject at risk of CDAD, such as e.g. a subject with the following profile: i) a subject with a weaker immune system such as e.g. an elderly subject (e.g. a subject above 65 years of age) or a subject below 2 years of age; ii) an immunocompromised subject such as e.g. a subject with AIDS; iii) a subject taking or planning to take immunosuppressing drugs; iv) a subject with planned hospitalization or a subject that is in hospital; v) a subject in or expected to go to an intensive care unit (ICU); vi) a subject that is undergoing or is planning to undergo gastrointestinal surgery; vii) a subject that is in or planning to go to a long-term care such as a nursing home; viii) a subject with co-morbidities requiring frequent and/or prolonged antibiotic use; ix) a subject that is a subject with two or more of the above mentioned profiles, such as e.g. an elderly subject that is planning to undergo a gastrointestinal surgery; x) a subject with inflammatory bowel disease; and/or xi) a subject with recurrent CDAD such as e.g. a subject having experienced one or more episodes of CDAD.

In one embodiment, the invention provides methods of producing the C-TAB.G5 or C-TAB.G5.1 isolated polypeptide. The C-TAB.G5 or C-TAB.G5.1 isolated polypeptide may be produced from a nucleic acid encoding the C-TAB.G5 or C-TAB.G5.1 isolated polypeptide using a bacterial expression system, such as an E. coli expression system.

In one embodiment the present invention provides the C-TAB.G5 or C-TAB.G5.1 isolated polypeptide wherein the 19 repeating units of toxin A are connected to the 23 repeating units of toxin B via a linker consisting of at least 4, 5, 6, 7, 8, 9, or 10 amino acid residues. By way of example, the linker of the present invention may comprise the sequence RSMH (Arg-Ser-Met-His) (amino acids 439-442 of SEQ ID NO: 2 or SEQ ID NO: 4).

In another embodiment the invention provides a variant of the isolated polypeptide that comprises at least one mutation (e.g., insertion, substitution or deletion), for example in the ARU and/or BRU. The sequence of the variant may have 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identity to SEQ ID NO: 2.

This invention also provides methods for producing the C-TAB.G5 or C-TAB.G5.1 isolated polypeptide or variants thereof through recombinant DNA engineering, bacterial fermentation and protein purification. In one embodiment, the present invention provides methods for constructing the nucleic acid encoding the C-TAB.G5 or C-TAB.G5.1 isolated polypeptide. In another embodiment, the invention provides methods of producing the C-TAB.G5 or C-TAB.G5.1 isolated polypeptide using a bacterial expression system, such as an E. coli expression system.

The invention further provides methods for preventing and treating CDAD in subjects in need thereof, such as humans. In this method the C-TAB.G5 or C-TAB.G5.1 is administered to a subject either alone or co-administered with one or more adjuvants such as alum or others. Subjects may be healthy individuals who are at risk for exposure to C. difficile, human subjects who have been treated and recovered from C. difficile infection and who are at risk for re-infection by C. difficile, or human subjects who are currently infected with C. difficile and whose condition may be improved by induction of C. difficile toxin-neutralizing antibody.

The present invention provides an immunogenic composition comprising C-TAB.G5 or C-TAB.G5.1. The immunogenic composition may further include an adjuvant to enhance an antigen-specific immune response and/or a pharmaceutically acceptable carrier and/or other components in a formulation suitable for application to a subject in need thereof. The immunogenic composition may be delivered by intramuscular (IM) delivery, intradermal (ID) delivery, subcutaneous (SC) delivery, intraperitoneal (IP) delivery, oral delivery, nasal delivery, buccal delivery, or rectal delivery.

In another embodiment of the invention the immunogenic composition elicits antibodies that bind native C. difficile toxins and neutralize their cytotoxic activity thus providing long-term, active protection, and/or treatment against C. difficile associated disease (CDAD).

Accordingly, the invention provides immunogenic compositions useful for the prevention or treatment of C. difficile associated disease in subjects in need thereof.

In another embodiment, the invention provides nucleic acids and fragments or variants thereof that encode C-TAB.G5 or C-TAB.G5.1. The invention also provides expression vectors comprising the nucleic acid encoding C-TAB.G5 or C-TAB.G5.1.

Another embodiment of the present invention provides antibodies and fragments thereof, such as neutralizing, humanized, monoclonal, chimeric and polyclonal antibodies, specific for C-TAB.G5 or C-TAB.G5.1. The antibodies or fragments thereof may recognize toxin A and/or toxin B.

Another embodiment provides a vaccine comprising a polypeptide having the amino acid sequence of SEQ ID NO: 2 or SEQ ID NO: 4.

Another embodiment of this invention provides diagnostic kits comprising the nucleic acids, polypeptides and/or antibodies of the invention.

Other embodiments and advantages of the invention are set forth in part in the description, which follows, and in part, may be obvious from this description, or may be learned from the practice of the invention.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1A shows the nucleic acid encoding the C-TAB.G5 isolated polypeptide (SEQ ID NO: 1). FIG. 1B shows the amino acid sequence of the C-TAB.G5 isolated polypeptide (SEQ ID NO: 2). The amino acid linker between the toxin A domain and the toxin B domain is underlined.

FIG. 2A shows the nucleic acid encoding the C-TAB.G5.1 isolated polypeptide (SEQ ID NO: 3). FIG. 2B shows the amino acid sequence of the C-TAB.G5.1 isolated polypeptide (SEQ ID NO: 4). The amino acid linker between the toxin A domain and the toxin B domain is underlined.

FIG. 3 shows the enhancement of antibody production in C-TAB.G5 vaccinated mice by increasing doses of C-TAB.G5 and co-delivery with alum adjuvant. Mice received two vaccinations by IM injection. IgG titers for anti-C-TAB, anti-toxin A and anti-toxin B antibodies were evaluated by ELISA two weeks after the first and second injection.

FIG. 4 shows a graphical representation of anti-C-TAB, anti-toxin A, and anti-toxin B IgG induction in mice receiving increasing doses of C-TAB.G5 with and without alum by two IM injection.

FIG. 5 shows antibody titers over one log dose range in mice immunized with C-TAB.G5 in the presence or absence of alum. IgG titers were evaluated by ELISA two weeks after the second immunization. The data demonstrate that alum significantly augments antibody production in vaccinated mice.

FIG. 6 shows protective effect in mice vaccinated with C-TAB.G5 (with and without alum) and then exposed to a lethal dose of toxin A or toxin B. Mice receiving two vaccinations (IM) in two week interval were challenged (IP) three weeks later. Toxin A and toxin B neutralizing antibodies (TNA) were assessed two weeks after the second injection, and the percent of animals survived the lethal challenge was determined. Increased doses of C-TAB.G5 conferred greater TNA production, as well as increased protection to the lethal challenge. The presence of alum further increased TNA production, as well as conferring higher survival at lower doses.

FIG. 7 shows a comparison of antibody response and protection efficacy of C-TAB.G5 in vaccinated young (6-7 weeks) and old (18 months) mice. Mice receiving two vaccinations (IM) in two week interval were challenged (IP) three weeks later. ELISA IgG titers for anti-C-TAB, anti-toxin A and anti-toxin B antibodies, TNA production as well as overall survival were assessed. Young mice demonstrated higher antibody response even without alum, and both groups showed improved survival when vaccinated in the presence of alum.

FIG. 8 shows a comparison of the kinetics of anti-C-TAB IgG antibody development in vaccinated young and old mice. Young mice demonstrated greater rates and earlier IgG production, and both groups demonstrated improved responses when vaccinated in the presence of alum.

FIG. 9 shows a comparison in anti-C-TAB, anti-toxin A and anti-toxin B antibody production in mice immunized with either C-TAB.G5.1 or toxoid A and B mixture (1:1). Mice received two vaccinations IM injection. IgG titers for anti-C-TAB, anti-toxin A and anti-toxin B antibodies were evaluated by ELISA two weeks after the second injection. Immunization with toxoid induces antibody to the N-terminal portion of the toxin molecule while immunization with C-TAB induces antibody to the C-terminal portion of the toxin molecule.

FIG. 10 shows a comparison in TNA production and protection against challenge with toxin A or B in mice immunized with either C-TAB.G5.1 or toxoid A and B mixture. Mice receiving two vaccinations (IM) in two week interval were challenged (IP) three weeks later with a lethal dose of toxin A or toxin B.

FIGS. 11A-11C show anti-C-TAB (FIG. 11A), anti-toxin A (FIG. 11B), and anti-toxin B (FIG. 11C) IgG production in hamsters immunized with C-TAB.G5.1 with and without alum. Hamsters received three vaccinations by IM injection on day 0 and day 14. IgG titers for anti-C-TAB, anti-toxin A and anti-toxin B antibodies were evaluated by ELISA on days 14, 28 and 35.

FIG. 12 shows a graphical representation of anti-C-TAB IgG antibody development in hamsters immunized with C-TAB.G5.1 with or without alum.

FIG. 13 shows a comparison in TNA and protection in hamsters immunized with C-TAB.G5.1 with or without alum. Two weeks after the third vaccination hamsters received a lethal dose of toxin A or toxin B by IP injection.

FIG. 14 shows survival of hamsters vaccinated with C-TAB.G5.1 following the intragastric administration of a lethal dose of C. difficile spores. Survival data was plotted as Kaplan-Meier survival fit curves and statistical analysis was done using a log rank analysis. At all spore doses (10², 10³ and 10⁴), 100% survival of hamsters in the vaccinated group was observed and survival was significantly enhanced when compared to the placebo group.

FIG. 15 shows anti-C-TAB, anti-toxin A, and anti-toxin B antibody production in cyanomologous monkeys immunized with C-TAB.G5.1 in the presence or absence of alum. Two groups of monkeys (three per group, 4-6 years) received 200 μg of C-TAB.G5.1 with or without 250 μg alum. Blood samples were taken on study days 0, 14, 28 and 42. ELISA method was used to assess anti-C-TAB, anti-toxin A and anti-toxin B IgG titers.

FIG. 16 shows a comparison of immunogenicity of C-TAB.G5 and C-TAB.G5.1 delivered over a 1 μg-30 μg dose range either in PBS or histidine buffer. Mice received two vaccinations (IM) in two week interval. IgG titers for anti-C-TAB, anti-toxin A and anti-toxin B antibodies were evaluated by ELISA two weeks after the second injection. All three antibody titers were not significantly different (T-test analysis) between C-TAB.G5 delivered in PBS or histidine buffer and C-TAB.G5.1 delivered in histidine buffer.

FIG. 17 shows a comparison of immunogenicity of C-TAB.G5, C-TABNCTB and C-TADCTB in mice. Mice received two vaccinations of each recombinant protein in two week interval by IM injection. All immunizations were done in the absence of alum adjuvant. IgG titers for anti-C-TAB, anti-toxin A and anti-toxin B antibodies were evaluated by ELISA two weeks after the second injection. All three fusion proteins demonstrate high immunogenicity.

FIG. 18 shows protection against challenge with native toxin B in mice. Mice were immunized as indicated for FIG. 17 and three weeks later they were challenged by IP injection with a lethal dose of native toxin B.

FIG. 19 shows a comparison in TNA and protection in hamsters vaccinated with either C-TAB.G5.1 or C-TADCTB in the absence or presence of alum. Two weeks after the third vaccination hamsters received a lethal dose of toxin A or toxin B by IP injection.

FIGS. 20A and 20B show TNA production and protection against challenge with toxin A or toxin B in mice immunized with C-TAB.G5.1 in different regimens. Comparison in TNA production and protection between groups of mice vaccinated by IM injection three times on day 0, 3 and 14, or on day 0, 7 and 21, or on day 0, 14 and 28. Three weeks after the last injection mice were challenged with a lethal dose of toxin A or toxin B (FIG. 20A is in table form and FIG. 20B is in graph form).

FIG. 21 shows protection (survival) against challenge with C. difficile toxin A (55 ng/mouse) in mice immunized with a single shot of 10 μg C-TAB.G5.1 and 12.5 μg alum (in 100 μl). Said challenge was done 21 days, 35 days or 49 days after immunization.

DETAILED DESCRIPTION

General Description

The present invention provides an immunogenic composition for inducing protective and/or therapeutic immune responses to C. difficile toxins A and B comprising use of a isolated polypeptide C-TAB.G5 (SEQ ID NO: 2) or a derivative thereof, C-TAB.G5.1 (SEQ ID NO: 4). that comprises 19 repeating units (RU) of toxin A and 23 repeating units (RU) of toxin B or peptide fragments, or variants thereof.

The present invention also provides methods of producing the C-TAB.G5 or C-TAB.G5.1 isolated polypeptide and the method of preparing the composition (e.g. a vaccine) useful for prevention and/or treatment of CDAD in mammals. The following description provides more details and examples for the construction, expression, and purification of the recombinant isolated polypeptides, their use as antigens for inducing a specific-immune response as well as evaluating protection in subjects. The subjects may be animals or humans.

The C-TAB.G5 or C-TAB.G5.1 isolated polypeptides for use in the methods and compositions of the present invention may be prepared using any of several standard methods. For example, the C-TAB.G5 or C-TAB.G5.1 may be produced using standard recombinant DNA techniques, wherein a suitable host cell is transformed with an appropriate expression vector containing a part of a toxin-encoding nucleic acid fragment (see e.g. Dove et al., Infect. Immun. 58:480-8 (1990), and Barroso et al., Nucleic Acids Research 18:4004 (1990). Any of a wide variety of expression systems may be used to produce the recombinant polypeptides. C-TAB.G5 or C-TAB.G5.1 may be produced in a prokaryotic host (e.g. a bacterium, such as E. coli or Bacillus) or in an eukaryotic host (e.g. yeast cells, mammalian cells (e.g. COS 1, NIH3T3, or JEG3 cells), or insect cells (e.g. Spodoptera frugiperda (SF9) cells)). Such cells are available, for example, from the American Type Culture Collection (ATCC). The method of transformation and transfection and the choice of expression vector will depend on the host system selected. Transformation and transfection methods are described by, e.g., Ausubel et al., ISBN: 047132938X C-TAB.G5 or C-TAB.G5.1, particularly short fragments, may also be produced by chemical synthesis, e.g., by the methods described in Solid Phase Peptide Synthesis, 1984, 2nd ed., Stewart and Young, Eds., Pierce Chemical Co., Rockford, Ill., or by standard in vitro translation methods.

In addition to the C-TAB.G5 or C-TAB.G5.1 sequences, the present invention provides variants thereof that are functionally active and immunogenic. The variants may have the same level of immunogenicity as C-TAB.G5 or C-TAB.G5.1. The variant may have amino acid substitutions, deletions, or insertions as compared to SEQ ID NO: 2 or SEQ ID NO: 4. Genes encoding C-TAB.G5 or C-TAB.G5.1 or variants thereof may be made using standard methods (see below; also see, e.g. Ausubel et al., supra).

In addition to the C-TAB.G5 or C-TAB.G5.1 sequences, the present invention provides further derivatives of C-TAB.G5 that comprise additional repeats. By way of example, a fusion protein, C-TABNCTB (SEQ ID NO: 18, encoded by SEQ ID NO: 17), comprises, like C-TAB.G5, 19 repeating units of CTA (amino acids 2272-2710), 23 repeating units of CTB (amino acids 1850-2366), and a further additional 10 repeats of CTB (amino acids 1834-2057) fused to the C-terminus of CTB. A further variant, C-TADCTB fusion protein (SEQ ID NO: 20, encoded by SEQ ID NO:19) comprises C-TAB.G5 (19 repeats of CTA and 23 repeats of CTB) plus an additional 24 repeating units of CTB (amino acids 1834-2366) fused to the C-terminus of C-TAB.G5. A variant may also comprise additional copies of C-TAB.G5 or portions thereof. For example, C-TADCTB comprises a double portion of the repeating units of CTB present in C-TAB.G5.

The present invention provides methods for high level expression C-TAB.G5 or C-TAB.G5.1 in bacterial system such as E. coli comprising introducing a nucleic acid encoding C-TAB.G5 or C-TAB.G5.1 into a bacterial host cell and expressing C-TAB.G5 or C-TAB.G5.1.

In addition, the C-TAB.G5 or C-TAB.G5.1 isolated polypeptide of the present invention may be covalently coupled or cross-linked to adjuvants (see, e.g., Cryz et al., Vaccine 13:67-71(1994); Liang et al., J. Immunology 141:1495-501 (1988) and Czerkinsky et al., Infect. Immun. 57:1072-77 (1989)).

The present invention provides a vaccine comprising the C-TAB.G5 or C-TAB.G5.1 isolated polypeptide that can protect and provide therapy against CDAD. The vaccine of the present invention comprises a novel antigen which can be delivered intramuscularly (IM), intradermally (ID), subcutaneously (SC), orally, nasally, buccally, or rectally routes. The vaccine may provide immune protection or induce antibodies for passive immunization.

The C-TAB.G5 or C-TAB.G5.1 isolated polypeptide of the present invention provides a vaccine to immunize against CDAD. The C-TAB.G5 or C-TAB.G5.1 isolated polypeptide of the present invention or variants thereof, is a combined vaccine candidate targeted to broaden the protective coverage against C. difficile associated diseases, such as CDAD, to a level not known or published hitherto. This concept of a single vaccine offering protection or a diminished severity of C. difficile associated diseases represents a unique step forward in managing public health at a global level and especially reducing the severity of epidemics (e.g. nursing homes, cruise ships).

As used herein, “toxin A protein” or “toxin B protein” refers to toxic proteins of C. difficile that are primarily responsible for CDAD. Toxin A and toxin B comprise multiple repeating units responsible for immunogenicity in the C-terminal binding domains.

As used herein “wild-type” or “native” refers to a full length protein comprised of a nucleic acid or amino acid sequence as would be found endogenously in a host cell.

As used herein, the terms “Clostridium difficile associated disease”, “Clostridium difficile related disease”, “Clostridium difficile-associated disease”, “Clostridium difficile toxin-mediated disease”, “Clostridium difficile infection”, and “CDAD” refer to diseases caused, directly or indirectly, by infection with Clostridium difficile.

“Antigen” refers to a substance that induces a specific immune response when presented to immune cells of an organism. For example, an antigen may be a nucleic acid, a protein, a polypeptide, a peptide, a glycoprotein, a carbohydrate, a lipid, a glycolipid, a lipoprotein, a fusion protein, a phospholipid, or a conjugate of a combination thereof. An antigen may comprise a single immunogenic epitope, or a multiplicity of immunogenic epitopes recognized by a B-cell receptor (i.e., antibody on the membrane of the B cell) or a T-cell receptor. Antigen may be provided as a virus-like-particle (VLP) or a whole microbe or microorganism such as, for example, a bacterium or virion. The antigen may be an inactivated or attenuated live virus. The antigen may be obtained from an extract or lysate, either from whole cells or membrane alone; or antigen may be chemically synthesized or produced by recombinant means. An antigen may be administered by itself or with an adjuvant. A single antigen molecule may have both antigen and adjuvant properties.

By “adjuvant” is meant any substance that is used to specifically or non-specifically potentiate an antigen-specific immune response, perhaps through activation of antigen presenting cells. Examples of adjuvants include an oil emulsion (e.g., complete or incomplete Freund's adjuvant), Montanide incomplete Seppic adjuvant such as ISA, oil in water emulsion adjuvants such as the Ribi adjuvant system, syntax adjuvant formulation containing muramyl dipeptide, aluminum salt adjuvant (ALUM), polycationic polymer, especially polycationic peptide, especially polyarginine or a peptide containing at least two LysLeuLys motifs, especially KLKLLLLLKLK (SEQ ID NO: 21), immunostimulatory oligodeoxynucleotide (ODN) containing non-methylated cytosine-guanine dinucleotides (CpG) in a defined base context (e.g., as described in WO 96/02555) or ODNs based on inosine and cytidine (e.g., as described in WO 01/93903), or deoxynucleic acid containing deoxy-inosine and/or deoxyuridine residues (as described in WO 01/93905 and WO 02/095027), especially Oligo(dIdC)₁₃ (as described in WO 01/93903 and WO 01/93905), neuroactive compound, especially human growth hormone (described in WO 01/24822), or combinations thereof, a chemokine (e.g., defensins 1 or 2, RANTES, MIP1-α, MIP-2, interleukin-8, or a cytokine (e.g., interleukin-1β, -2, -6, -10 or -12; interferon-γ; tumor necrosis factor-α; or granulocyte-monocyte-colony stimulating factor) (reviewed in Nohria and Rubin, 1994), a muramyl dipeptide variant (e.g., murabutide, threonyl-MDP or muramyl tripeptide), synthetic variants of MDP, a heat shock protein or a variant, a variant of Leishmania major LeIF (Skeiky et al., 1995, J. Exp. Med. 181: 1527-1537), non-toxic variants of bacterial ADP-ribosylating exotoxins (bAREs) including variants at the trypsin cleavage site (Dickenson and Clements, (1995) Infection and Immunity 63 (5): 1617-1623) and/or affecting ADP-ribosylation (Douce et al., 1997) or chemically detoxified bAREs (toxoids), QS21, Quill A, N-acetylmuramyl-L-alanyl-D-isoglutamyl-L-alanine-2-[1,2-dipalmitoyl-s-glycero-3-(hydroxyphosphoryloxy)]ethylamide (MTP-PE) and compositions containing a metabolizable oil and an emulsifying agent. An adjuvant may be administered with an antigen or may be administered by itself, either by the same route as that of the antigen or by a different route than that of the antigen. A single adjuvant molecule may have both adjuvant and antigen properties.

By “effective amount” is meant an amount of a therapeutic agent sufficient to induce or enhance an antigen-specific immune response, for an antigen, or treat or diagnose a condition, for a drug. Such induction of an immune response may provide a treatment such as, for example, immunoprotection, desensitization, immunosuppression, modulation of autoimmune disease, potentiation of cancer immunosurveillance, or therapeutic vaccination against an established infectious disease. Treatment includes curing, amelioration, or prevention.

By “nucleic acid” is meant either a single deoxyribonucleic acid base or a ribonucleic acid or a sequence thereof joined by phosphodiester bonds.

By “therapeutic agent” is meant any molecule capable of use in treating a disease, alleviating the symptoms of a disease, preventing a disease, or diagnosing a disease. For example, a therapeutic agent may be an antigen or a drug.

By “subject” is meant an animal. The subject may be any animal, including any vertebrate. The subject may be a domestic livestock, laboratory animal (including but not limited to, rodents such as a rat, hamster, gerbil, or mouse) or pet animal. In one embodiment, the animal may be a mammal. Examples of mammals include humans, primates, marsupials, canines, monkeys, rodents, felines, apes, whales, dolphins, cows, pigs, and horses. The subject may be in need of treatment of a disease or may be in need of a prophylactic treatment.

As used herein, the term “antibody” means an immunoglobulin molecule or a fragment of an immunoglobulin molecule having the ability to specifically bind to a particular antigen. Antibodies are well known to those of ordinary skill in the science of immunology. As used herein, the term “antibody” means not only full-length antibody molecules but also fragments of antibody molecules retaining antigen binding ability. Such fragments are also well known in the art and are regularly employed both in vitro and in vivo. In particular, as used herein, the term “antibody” means not only full-length immunoglobulin molecules but also antigen binding active fragments such as the well-known active fragments F(ab′)2, Fab, Fv, and Fd.

As used herein, the term “variants” may include proteins and/or polypeptides and/or peptides that are different from a wild-type polypeptide, wherein one or more residues have been conservatively substituted with a functionally similar residue, and further which displays substantially identical functional properties of the wild-type polypeptide. Examples of conservative substitutions include substitution of one non-polar (hydrophobic) residue for another (e.g. isoleucine, valine, leucine or methionine) for another, substitution of one polar (hydrophilic) residue for another (e.g. between arginine and lysine, between glutamine and asparagine, between glycine and serine), substitution of one basic residue for another (e.g. lysine, arginine or histidine), or substitution of one acidic residue for another (e.g. aspartic acid or glutamic acid). A variant may include any polypeptide having a tertiary structure substantially identical to a polypeptide of the invention which also displays the functional properties of the polypeptides as described herein. A variant may be a mutant of a wild-type polypeptide.

As used herein “treatment” may include any type of intervention used in an attempt to alter the natural course of the individual or cell. Treatment may include, but is not limited to, administration of e.g., a pharmaceutical composition, alone or in combination with other treatment modalities generally known in the art. The “treatment” may be performed prophylactically, or subsequent to the initiation of a pathologic event.

As used herein, “pharmaceutically acceptable carrier” may include any material which, when combined with an active ingredient, allows the ingredient to retain biological activity and is non-reactive with the subject's immune system. The pharmaceutically acceptable carriers and/or excipients may include buffers, stabilizers, diluents, preservatives, and solubilizers. In general, the nature of the carrier or excipients will depend on the particular mode of administration being employed. For instance, parenteral formulations usually comprise injectable fluids that include pharmaceutically and physiologically acceptable fluids such as water, physiological saline, balanced salt solutions, aqueous dextrose, glycerol or the like as a vehicle. For solid compositions (e. g. powder, pill, tablet, or capsule forms), conventional non-toxic solid carriers can include, for example, pharmaceutical grades of mannitol, lactose, starch, or magnesium stearate. In addition to biologically neutral carriers, pharmaceutical compositions to be administered can contain minor amounts of non-toxic auxiliary substances, such as wetting or emulsifying agents, preservatives, and pH buffering agents and the like, for example sodium acetate or sorbitan monolaurate.

As used herein, “fusion” may refer to nucleic acids and polypeptides that comprise sequences that are not found naturally associated with each other in the order or context in which they are placed according to the present invention. A fusion nucleic acid or polypeptide does not necessarily comprise the natural sequence of the nucleic acid or polypeptide in its entirety. Fusion proteins have the two or more segments joined together through normal peptide bonds. Fusion nucleic acids have the two or more segments joined together through normal phosphodiester bonds.

Isolated Polypeptides

The present invention provides the C-TAB.G5 or C-TAB.G5.1 isolated polypeptides as set forth in SEQ ID NO: 2 and SEQ ID NO: 4, respectively, that comprises 19 repeating units of C. difficile toxin A and 23 repeating units of C. difficile toxin B. A homolog of C-TAB.G5, such as C-TAB.G5.1, may differ from C-TAB.G5 by 1, 2, 3, 4, 5, 6, 7, 8, 9, or 10 amino acids. The C-TAB.G5.1 polypeptide is a fusion protein containing the same C-terminal domain of toxin B as C-TAB.G5, but the C-terminal domain of toxin A derived from C. difficile VPI-10463 strain which is a homolog of the according C-TAB.G5 polypeptide derived from C. difficile 630 strain and differs by two amino acids at positions 155-156. The C-TAB.G5.1 coding sequence, as set forth in SEQ ID NO: 3, was codon optimized for improved expression within an E. coli host cell. The C-TAB.G5 or C-TAB.G5.1 isolated polypeptides of the present invention may be effective in neutralizing the toxic effects of C. difficile toxin A and toxin B.

Toxin A and toxin B are encoded by the trdA (SEQ ID NO: 5) and trdB (SEQ ID NO: 7) genes, of the C. difficile strain 630, respectively. Structurally, the C. difficile toxins comprise an ADP-glucosyl transferase domain, a cysteine protease domain, a hydrophobic region, and a receptor binding region. The C-terminal domain contains highly repetitive units (RUs) (also known as combined repetitive oligopeptides (CROPS)). The RUs may be long or short oligopeptides and may comprise 20 to 50 amino acids with a consensus YYF motif that is repeated. The RUs are grouped in clusters. As an example, toxin A, strain 630 (SEQ ID NO: 6) encoded by the wild-type trdA gene (SEQ ID NO: 5) contains 39 RUs. The 39 RUs are grouped into 8 clusters. Toxin B, strain 630 (SEQ ID NO: 8) encoded by the wild-type trdB gene (SEQ ID NO: 7) contains 24 RUs which are grouped into 5 clusters. Tables 1 and 2 below show the amino acid positions of each of the RUs in C. difficile toxin A and toxin B encoded by the trdA gene and trdB gene.

TABLE 1 Toxin A Repeating Units (ARU) AA START AA END (SEQ ID (SEQ ID CLUSTER REPEAT NO: 6) NO: 6) SEQ 1 S1 1832 1852 GLININNSLFYFDPIEFNLVT S1 1853 1873 GWQTINGKKYYFDINTGAALI S3 1874 1893 SYKIINGKHFYFNNDGVMQL L 1894 1924 GVFKGPDGFEYFAPANTQNNNIEGQAIVYQS 2 S1 1925 1944 KFLTLNGKKYYFDNNSKAVT S2 1945 1965 GWRIINNEKYYFNPNNAIAAV S3 1966 1986 GLQVIDNNKYYFNPDTAIISK S4 1987 2007 GWQTVNGSRYYFDTDTAIAFN S5 2008 2027 GYKTIDGKHFYFDSDCVVKI L 2028 2058 GVFSTSNGFEYFAPANTYNNNIEGQAIVYQS 3 S1 2059 2078 KFLTLNGKKYYFDNNSKAVT S2 2079 2099 GWQTIDSKKYYFNTNTAEAAT S3 2100 2120 GWQTIDGKKYYFNTNTAEAAT S4 2121 2141 GWQTIDGKKYYFNTNTAIAST S5 2142 2161 GYTIINGKHFYFNTDGIMQI L 2162 2192 GVFKGPNGFEYFAPANTDANNIEGQAILYQN 4 S1 2193 2212 EFLTLNGKKYYFGSDSKAVT S2 2213 2233 GWRIINNKKYYFNPNNAIAAI S3 2234 2253 HLCTINNDKYYFSYDGILQN S4 2254 2275 GYITIERNNFYFDANNESKMVT L 2276 2306 GVFKGPNGFEYFAPANTHNNNIEGQAIVYQN 5 S1 2307 2326 KFLTLNGKKYYFDNDSKAVT S2 2328 2347 GWQTIDGKKYYFNLNTAEAAT S3 2348 2368 GWQTIDGKKYYFNLNTAEAAT S4 2369 2389 GWQTIDGKKYYFNTNTFIAST S5 2390 2409 GYTSINGKHFYFNTDGIMQI L 2410 2440 GVFKGPNGFEYFAPANTDANNIEGQAILYQN 6 S1 2441 2460 KFLTLNGKKYYFGSDSKAVT S2 2461 2481 GLRTIDGKKYYFNTNTAVAVT S3 2482 2502 GWQTINGKKYYFNTNTSIAST S4 2503 2522 GYTIISGKHFYFNTDGIMQI L 2523 2553 GVFKGPDGFEYFAPANTDANNIEGQAIRYQN 7 S1 2554 2573 RFLYLHDNIYYFGNNSKAAT S1 2574 2594 GWVTIDGNRYYFEPNTAMGAN S3 2595 2613 GYKTIDNKNFYFRNGLPQI L 2614 2644 GVFKGSNGFEYFAPANTDANNIEGQAIRYQN 8 S1 2645 2664 RFLHLLGKIYYFGNNSKAVT S2 2665 2686 GWQTINGKVYYFMPDTAMAAAG S3 2687 2670 GLFEIDGVIYFFGVDGVKAPGIYG S: indicates a Short repeating unit L: indicates a Long repeating unit

TABLE 2 Toxin B Repeating Units (BRU) AA START AA END (SEQ ID NO: (SEQ ID CLUSTER REPEAT 8) NO: 8) SEQ 1 S1 1834 1854 GLIYINDSLYYFKPPVNNLIT S2 1855 1876 GFVTVGDDKYYFNPINGGAASI S3 1877 1896 GETIIDDKNYYFNQSGVLQT L 1897 1926 GVFSTEDGFKYFAPANTLDENLEGEAIDFT 2 S1 1927 1946 GKLIIDENIYYFDDNYRGAV S2 1947 1967 EWKELDGEMHYFSPETGKAFK S3 1968 1987 GLNQIGDYKYYSNSDGVMQK S4 1988 2007 GFVNINDKTFYFDDSGVMKS S5 2008 2027 GYTEIDGKHFYFAENGEMQI L 2028 2057 GVFNTEDGFKYFAHHNEDLGNEEGEEISYS 3 S1 2058 2078 GILNFNNKIYYFDDSFTAVVG S2 2079 2099 WKDLEDGSKYYFDEDTAEAYI S3 2100 2119 GLSLINDGQYYFNDDGIMQV S4 2120 2139 GFVTINDKVFYFSDSGIIES S5 2140 2159 GVQNIDDNYFYIDDNGIVQI L 2160 2189 GVFDTSDGYKYFAPANTVNDNIYGQAVEYS 4 S1 2190 2212 GLVRVGEDVYYFGETYTIETGWI S2 2213 2233 YDMENESDKYYFNPETKKACK S3 2234 2253 GINLIDDIKYYFDEKGIMRT S4 2254 2273 GLISFENNNYYFNENGEMQF S5 2274 2293 GYINIEDKMFYFGEDGVMQI L 2294 2323 GVFNTPDGFKYFAHQNTLDENFEGESINYT 5 S1 2324 2343 GWLDLDEKRYYFTDEYIAAT S2 2344 2366 GSVIIDGEEYYFDPDTAQLVISE S: indicates a Short repeating unit L: indicates a Long repeating unit

Accordingly, the C-TAB.G5 and C-TAB.G5.1 isolated polypeptides comprises 19 RUs from the C-terminal domain of C. difficile toxin A and 23 RUs from the C-terminal domain of C. difficile toxin B, respectively. The C-TAB.G5 or C-TAB.G5.1 comprises toxin A amino acids 2272-2710 of SEQ ID NO: 6 fused to toxin B amino acids 1850-2366 of SEQ ID NO: 8. The C-TAB.G5 or C-TAB.G5.1 isolated polypeptide comprises the amino acid sequence as set forth in SEQ ID NO: 2 and SEQ ID NO: 4, respectively.

The respective RUs in the C-TAB.G5 or C-TAB.G5.1 isolated polypeptide may also be from variants of C. difficile toxin A or toxin B. These RUs in the C-TAB isolated polypeptide may also be a combination of naturally occurring or variants of C. difficile toxin A or toxin B.

The RUs in the C-TAB.G5 or C-TAB.G5.1 isolated polypeptides comprise long RUs and short RUs, and the long RUs and the short RUs are arranged into a cluster. The C-TAB.G5 or C-TAB.G5.1 isolated polypeptides of the present invention comprises 4 clusters of 3 to 5 short RUs followed by one long RU of C. difficile toxin A and 5 clusters of 3 to 5 short RUs followed by one long RU of C. difficile toxin B.

The short and long RUs contain conserved motifs. The short repeating unit may comprise 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, or 26 amino acids. Each short repeating unit may comprise conserved tyrosine motifs, such as YYF, FYF, YFF, FYI, or HYF. A short repeat unit may further comprise an aspartate/histidine residue prior to the tyrosine motif if the following repeating unit is a long repeating unit. The long repeating unit may comprise 27, 28, 29, 30, 31, 32, 33, 34, or 35 amino acids. Each long repeating unit may comprise a tyrosine repeat motif such as FEYF (SEQ ID NO: 22), FKYF (SEQ ID NO: 23), or YKYF (SEQ ID NO: 24).

In the present invention, the toxin A and toxin B portions of the C-TAB.G5 or C-TAB.G5.1 isolated polypeptides may be fused directly together. The toxin A and toxin B portions may be spaced apart by a linker region. A linker region may comprise 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11 to 15, 20 to 30, 40, 45, or 50 amino acids. Those skilled in the art will recognize that the linker region may be adapted to alter the positioning of the toxin A and toxin B portions so that in their expressed and folded shape each toxin repeating unit in the C-TAB.G5 or C-TAB.G5.1 isolated polypeptides is positioned to optimally expose potential epitopes and to retain its immunogenicity. The RUs and the clusters in the C-TAB isolated polypeptides may also be separated by linkers. In one embodiment, the linker comprises the peptide RSMH (439-442 of SEQ ID NO: 2 or SEQ ID NO: 4).

The C-TAB isolated polypeptides of the present invention may have at least 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% sequence identity or sequence similarity with SEQ ID NO: 2 or SEQ ID NO: 4. As known in the art “similarity” between two polypeptides or polynucleotides is determined by comparing the amino acid or nucleotide sequence and its conserved nucleotide or amino acid substitutes of one polynucleotide or polypeptide to the sequence of a second polynucleotide or polypeptide. Also known in the art is “identity” which means the degree of sequence relatedness between two polypeptide or two polynucleotide sequences as determined by the identity of the match between two strings of such sequences. Both identity and similarity can be readily calculated (Computational Molecular Biology, Lesk, A. M., ed., Oxford University Press, New York, 1988; Biocomputing: Informatics and Genome Projects, Smith, D. W., ed., Academic Press, New York, 1993; Computer Analysis of Sequence Data, Part I, Griffin, A. M., and Griffin, H. G., eds., Humana Press, New Jersey, 1994; Sequence Analysis in Molecular Biology, von Heinje, G., Academic Press, 1987; and Sequence Analysis Primer, Gribskov, M. and Devereux, J., eds., M Stockton Press, New York, 1991). While there exist a number of methods to measure identity and similarity between two polynucleotide or polypeptide sequences, the terms “identity” and “similarity” are well known to skilled artisans (Sequence Analysis in Molecular Biology, von Heinje, G., Academic Press, 1987; Sequence Analysis Primer, Gribskov, M. and Devereux, J., eds., M Stockton Press, New York, 1991; and Carillo, H., and Lipman, D., SIAM J. Applied Math., 48: 1073 (1988). Methods commonly employed to determine identity or similarity between two sequences include, but are not limited to those disclosed in Guide to Huge Computers, Martin J. Bishop, ed., Academic Press, San Diego, 1994, and Carillo, H., and Lipman, D., SIAM J. Applied Math. 48:1073 (1988).

The C-TAB.G5 or C-TAB.G5.1 isolated polypeptides of the present invention are immunogenic. For example, the C-TAB.G5 or C-TAB.G5.1 isolated polypeptides of the present invention may have at least 50%, 60%, 70%, 80%, or 90% of the immunological activity of the corresponding bacterial toxin A, and the C-TAB.G5 or C-TAB.G5.1 isolated polypeptides may have at least 50%, 60%, 70%, 80%, or 90% of the immunological activity of the corresponding bacterial toxin B. The C-TAB.G5 or C-TAB.G5.1 isolated polypeptides of the present invention may be used as vaccines for treating, preventing, or alleviating the symptoms of CDAD.

The C-TAB.G5 or C-TAB.G5.1 isolated polypeptides of the present invention also include variants of the C-TAB.G5 or C-TAB.G5.1 isolated polypeptide having SEQ ID NO: 2 or SEQ ID NO: 4, respectively. The variants may have amino acid insertions, substitutions and/or deletions that have minimal to no effect on the activity, function or shape of the isolated polypeptide. Examples of such substitutions include the substitution of one non-polar residue for another, the substitution of one polar residue for another, the substitution of one basic residue for another, or the substitution of one acidic residue for another. The C-TAB.G5 or C-TAB.G5.1 isolated polypeptide variants may further include insertions, substitutions and/or deletions of amino acids in a comparison to the amino acid sequence of the extracellular domain of native toxin A or toxin B that yield minimal effect on the activity, function and/or structure of the polypeptide. Those skilled in the art will recognize non-natural amino acids may also be used. Non-natural amino acids include, for example, beta-alanine (beta-Ala), or other omega-amino acids, such as 3-amino propionic, 2,3-diamino propionic (2,3-diaP), 4-amino butyric and so forth, alpha-aminisobutyric acid (Aib), sarcosine (Sat), ornithine (Orn), citrulline (Cit), t-butylalanine (t-BuA), t-butylglycine (t-BuG), N-methylisoleucine (N-Melle), phenylglycine (Phg), and cyclohexylalanine (Cha), norleucine (Nle), cysteic acid (Cya) 2-naphthylalanine (2-Nal); 1,2,3,4-tetrahydroisoquinoline-3-carboxylic acid (Tic); beta-2-thienylalanine (Thi); and methionine sulfoxide (MSO).

The nucleotide sequences encoding C-TAB.G5 or C-TAB.G5.1 isolated polypeptides of the present invention may be codon optimized to enhance expression in varying host cells. Codon optimization refers to modifying the nucleotide sequence in order to enhance protein expression in a host cell of interest by replacing one or more codons of the native sequence with codons that are more frequently used in the genes of that host cell or in the genes of the host the cell was derived from. Various species exhibit particular bias for certain codons of a particular amino acid. The present invention provides codon-optimized nucleotide sequence encoding the C-TAB.G5.1 isolated polypeptide for enhanced expression in E. coli.

The C-TAB.G5 or C-TAB.G5.1 isolated polypeptides of the present invention may be prepared by any known techniques. For example, the isolated polypeptides may be expressed through genetic engineering. By way of example, the translation of recombinant DNA. The C-TAB.G5 or C-TAB.G5.1 isolated polypeptides may also be prepared synthetically. By way of example, the C-TAB.G5 or C-TAB.G5.1 isolated polypeptides may be synthesized using the solid-phase synthetic technique initially described by Merrifield (J. Am Chem. Soc. 85:2149-2154), which is incorporated herein by reference. Other polypeptide synthesis techniques may be found, for example, Kent et al. (1985) in Synthetic Peptides in Biology and Medicine, eds. Alitalo et al., Elsevier Science Publishers, 295-358.

The C-TAB.G5 or C-TAB.G5.1 isolated polypeptides of the present invention may be isolated or obtained in substantially pure form. Substantially pure means that the proteins and/or polypeptides and/or peptides are essentially free of other substances with which they may be found in nature or in vivo systems to an extent practical and appropriate for their intended use. In particular, the C-TAB.G5 or C-TAB.G5.1 isolated polypeptides are sufficiently pure and are sufficiently free from other biological constituents of their host cells so as to be useful in, for example, generating antibodies, sequencing, or producing pharmaceutical preparations. By techniques well known in the art, substantially pure polypeptides may be produced in light of the nucleic acid and amino acid sequences disclosed herein. Because a substantially purified isolated polypeptide of the invention may be admixed with a pharmaceutically acceptable carrier in a pharmaceutical preparation, the isolated polypeptide may comprise only a certain percentage by weight of the preparation. The isolated polypeptide is nonetheless substantially pure in that it has been substantially separated from the substances with which it may be associated in living systems.

The present invention further provides isolated C-TAB.G5 or C-TAB.G5.1 isolated polypeptides comprising additional polypeptides. The additional polypeptides may be fragments of a larger polypeptide. In one embodiment, there are one, two, three, four, or more additional polypeptides fused to the C-TAB.G5 or C-TAB.G5.1 isolated polypeptides. In some embodiments, the additional polypeptides are fused toward the amino terminus of the C-TAB.G5 or C-TAB.G5.1 isolated polypeptides. In other embodiments, the additional polypeptides are fused toward the carboxyl terminus of the C-TAB.G5 or C-TAB.G5.1 isolated polypeptides. In further embodiments, the additional polypeptides flank the C-TAB.G5 or C-TAB.G5.1 isolated polypeptides. In yet further embodiments, the additional polypeptides are dispersed between the toxin A portion and the toxin B portion of the C-TAB.G5 or C-TAB.G5.1 isolated polypeptides.

In some embodiments, the additional polypeptides aid in directing the secretion or subcellular localization of the C-TAB.G5 or C-TAB.G5.1 isolated polypeptides. Such polypeptides are referred to as a “signal sequence.” A secretory signal is described, for example U.S. Pat. No. 6,291,212 and U.S. Pat. No. 5,547,871, both of which are herein incorporated by reference in their entirety. Secretory signal sequence encodes secretory peptides. A secretory peptide is an amino acid sequence that acts to direct the secretion of C-TAB.G5 or C-TAB.G5.1 from a cell. Secretory peptides are generally characterized by a core of hydrophobic amino acids and are typically (but not exclusively) found at the amino termini of newly synthesized proteins. The secretory peptide may be cleaved from C-TAB.G5 or C-TAB.G5.1 isolated polypeptide during secretion. Secretory peptides may contain processing sites that allow cleavage of the signal peptide from the mature protein as it passes through the secretory pathway. Processing sites may be encoded within the signal peptide or may be added to the signal peptide by, for example, in vitro mutagenesis. Secretory signal sequences may be required for a complex series of post-translational processing steps to allow for secretion of C-TAB.G5 or C-TAB.G5.1. The signal sequence may immediately follow the initiation codon and encodes a signal peptide at the amino-terminal end of C-TAB.G5 or C-TAB.G5.1. The signal sequence may precede the stop codon and encodes a signal peptide at the carboxy-terminal end of C-TAB.G5 or C-TAB.G5.1. In most cases, the signal sequence is cleaved off by a specific protease, called a signal peptidase. Examples of a secretory signal sequences include, but are not limited to ompA, pelB, and ST pre-pro.

In some embodiments, the additional polypeptides aid the stabilization, structure and/or the purification of the C-TAB.G5 or C-TAB.G5.1 isolated polypeptides. In some embodiments the additional polypeptides may comprise an epitope. In other embodiments, the additional polypeptides may comprise an affinity tag. By way of example, fusion of a polypeptide comprising an epitope and/or an affinity tag to the C-TAB.G5 or C-TAB.G5.1 isolated polypeptide may aid purification and/or identification of the polypeptide. By way of example, the polypeptide segment may be a His-tag, a myc-tag, an S-peptide tag, a MBP tag (maltose binding protein), a GST tag (glutathione S-transferase), a FLAG tag, a thioredoxin tag, a GFP tag (green fluorescent protein), a BCCP (biotin carboxyl carrier protein), a calmodulin tag, a Strep tag, an HSV-epitope tag, a V5-epitope tag, and a CBP tag. The use of such epitopes and affinity tags is known to those skilled in the art.

In further embodiments, the additional polypeptides may provide a C-TAB.G5 or C-TAB.G5.1 isolated polypeptide comprising sites for cleavage of the polypeptide. As an example, a polypeptide may be cleaved by hydrolysis of the peptide bond. In some embodiments, the cleavage is performed by an enzyme. In some embodiments, cleavage occurs in the cell. In other embodiments, cleavage occurs through artificial manipulation and/or artificial introduction of a cleaving enzyme. By way of example, cleavage enzymes may include pepsin, trypsin, chymotrypsin, thrombin, and/or Factor Xa. Cleavage allows ease of isolating the C-TAB.G5 or C-TAB.G5.1 isolated polypeptides from the polypeptides. Cleavage may further allow for the separation of the toxin A portion from the toxin B portion. Cleavage may also allow isolation of the C-TAB.G5 or C-TAB.G5.1 isolated polypeptide fused to polypeptides from other polypeptides, such as through cleavage of an epitope utilized to purify the expressed protein.

The C-TAB.G5 or C-TAB.G5.1 isolated polypeptides may further possess additional structural modifications not shared with the same organically synthesized peptide, such as adenylation, carboxylation, glycosylation, hydroxylation, methylation, phosphorylation or myristylation. These added structural modifications may be further be selected or preferred by the appropriate choice of recombinant expression system. On the other hand, fusion polypeptides may have its sequence extended by the principles and practice of organic synthesis.

The present invention also provides nucleic acids encoding the C-TAB.G5 or C-TAB.G5.1 isolated polypeptides comprising a polypeptide portion obtained from C. difficile toxin A and a polypeptide portion obtained from C. difficile toxin B. Nucleic acids may include single or double stranded forms of deoxyribonucleotides or ribonucleotides or polymers thereof. The present invention provides ribonucleic acids encoding the C-TAB.G5 or C-TAB.G5.1 isolated polypeptides. The present invention also provides for nucleic acids that hybridize under stringent conditions to a nucleic acid encoding the C-TAB.G5 or C-TAB.G5.1 isolated polypeptide and the complement thereof. Stringent conditions refer to the degree of homology between a probe and a filter-bound nucleic acid; the higher the stringency, the higher percent homology between the probe and filter bound nucleic acid. The temperature for a stringent wash may be determined based on the Tm of the nucleic acid (based on G/C content). Stringent conditions may further be affected by the concentration of salt in a buffer, such as standard sodium citrate (SSC). The present invention provides for nucleic acids having about 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% sequence similarity or sequence identity with SEQ ID NO: 1.

The C-TAB.G5 or C-TAB.G5.1 isolated polypeptide may further comprise a linker region, for instance a linker less than about 50, 40, 30, 20, 10, 9, 8, 7, 6, 5, 4, 3, 2, or 1 amino acid residues. The linker can be covalently linked to and between the polypeptide portion derived from toxin A or portion thereof and the polypeptide portion derived from toxin B.

The present invention provides nucleic acids encoding the C-TAB.G5 or C-TAB.G5.1 isolated polypeptides that are degenerate to SEQ ID NO: 1 or SEQ ID NO: 3, respectively. The degeneracy of the genetic code permits variations of the nucleotide sequence of a toxin A protein, a toxin B protein and/or isolated polypeptide of interest, while still producing a polypeptide having the identical amino acid sequence as the polypeptide encoded by the native DNA sequence. The procedure, known as “codon optimization” (described in U.S. Pat. No. 5,547,871 which is incorporated herein by reference in its entirety) provides one with a means of designing such an altered DNA sequence. The design of codon optimized genes should take into account a variety of factors, including the frequency of codon usage in an organism, nearest neighbor frequencies, RNA stability, the potential for secondary structure formation, the route of synthesis and the intended future DNA manipulations of that gene. In particular, available methods may be used to alter the codons encoding a given isolated polypeptide with those most readily recognized by yeast when yeast expression systems are used, or by insect cells when the insect cell expression system is used. The degeneracy of the genetic code also permits the same amino acid sequence to be encoded and translated in many different ways. For example, leucine, serine and arginine are each encoded by six different codons, while valine, proline, threonine, alanine and glycine are each encoded by four different codons. However, the frequency of use of such synonymous codons varies from genome to genome among eukaryotes and prokaryotes. For example, synonymous codon-choice patterns among mammals are very similar, while evolutionarily distant organisms such as yeast (such as S. cerevisiae), bacteria (such as E. coli) and insects (such as D. melanogaster) reveal a clearly different pattern of genomic codon use frequencies (Grantham, R., et al., Nucl. Acid Res., 8, 49-62 (1980); Grantham, R., et al., Nucl. Acid Res., 9, 43-74 (1981); Maroyama, T., et al., Nucl. Acid Res., 14, 151-197 (1986); Aota, S., et al., Nucl. Acid Res., 16, 315-402 (1988); Wada, K., et al., Nucl. Acid Res., 19 Supp., 1981-1985 (1991); Kurland, C. G., FEBS Lett., 285, 165-169 (1991)). These differences in codon-choice patterns appear to contribute to the overall expression levels of individual genes by modulating peptide elongation rates. (Kurland, C. G., FEBS Lett., 285, 165-169 (1991); Pedersen, S., EMBO J., 3, 2895-2898 (1984); Sorensen, M. A., J. Mol. Biol., 207, 365-377 (1989); Randall, L. L., et al., Eur. J. Biochem., 107, 375-379 (1980); Curran, J. F., and Yarus, M., J. Mol. Biol., 209, 65-77 (1989); Varenne, S., et al., J. Mol. Biol., 180, 549-576 (1984), Varenne, S., et al., J. Mol, Biol., 180, 549-576 (1984); Garel, J.-P., J. Theor. Biol., 43, 211-225 (1974); Ikemura, T., J. Mol. Biol., 146, 1-21 (1981); Ikemura, T., J. Mol. Biol., 151, 389-409 (1981)).

The preferred codon usage frequencies for a synthetic gene should reflect the codon usages of nuclear genes derived from the exact (or as closely related as possible) genome of the cell/organism that is intended to be used for recombinant protein expression.

Preferred methods to determine identity are designed to give the largest match between the two sequences tested. Methods to determine identity and similarity are codified in computer programs. Preferred computer program methods to determine identity and similarity between two sequences include, but are not limited to, GCG program package (Devereux, et al., Nucl. Acid Res. 12(1):387 (1984)), BLASTP, BLASTN, FASTA (Atschul, et al., J. Mol. Biol. 215:403 (1990)). The degree of similarity or identity referred to above is determined as the degree of identity between the two sequences, often indicating a derivation of the first sequence from the second. The degree of identity between two nucleic acids may be determined by means of computer programs known in the art such as GAP provided in the GCG program package (Needleman and Wunsch J. Mol. Biol. 48:443-453 (1970)). For purposes of determining the degree of identity between two nucleic acids for the present invention, GAP is used with the following settings: GAP creation penalty of 5.0 and GAP extension penalty of 0.3.

The present invention also provides a vector comprising a nucleic acid encoding for the C-TAB.G5 or C-TAB.G5.1 isolated polypeptide. A vector may be any of a number of nucleic acids into which a desired sequence may be inserted by restriction and ligation for transport between different genetic environments or for expression in a host cell. Vectors are typically composed of DNA, although RNA vectors are also available. Vectors include, but are not limited to, plasmids and phagemids. A cloning vector is one which is able to replicate in a host cell, and which is further characterized by one or more endonuclease restriction sites at which the vector may be cut in a determinable fashion and into which a desired DNA sequence may be ligated such that the new recombinant vector retains its ability to replicate in the host cell. In the case of plasmids, replication of the desired sequence may occur many times as the plasmid increases in copy number within the host bacterium or just a single time per host before the host reproduces by mitosis. In the case of phage, replication may occur actively during a lytic phase or passively during a lysogenic phase.

Vectors may further contain a promoter sequence. A promoter may include an untranslated nucleic acid usually located upstream of the coding region that contains the site for initiating transcription of the nucleic acid. The promoter region may also include other elements that act as regulators of gene expression. In further embodiments of the invention, the expression vector contains an additional region to aid in selection of cells that have the expression vector incorporated. The promoter sequence is often bounded (inclusively) at its 3′ terminus by the transcription initiation site and extends upstream (5′ direction) to include the minimum number of bases or elements necessary to initiate transcription at levels detectable above background. Within the promoter sequence will be found a transcription initiation site, as well as protein binding domains responsible for the binding of RNA polymerase. Eukaryotic promoters will often, but not always, contain “TATA” boxes and “CAT” boxes.

Vectors may further contain one or more marker sequences suitable for use in the identification and selection of cells which have been transformed or transfected with the vector. Markers include, for example, genes encoding proteins which increase or decrease either resistance or sensitivity to antibiotics or other compounds, genes which encode enzymes whose activities are detectable by standard assays known in the art (e.g., β-galactosidase or alkaline phosphatase), and genes which visibly affect the phenotype of transformed or transfected cells, hosts, colonies or plaques. Preferred vectors are those capable of autonomous replication and expression of the structural gene products present in the DNA segments to which they are operably joined.

An expression vector is one into which a desired nucleic acid may be inserted by restriction and ligation such that it is operably joined to regulatory sequences and may be expressed as an RNA transcript. Expression refers to the transcription and/or translation of an endogenous gene, transgene or coding region in a cell.

A coding sequence and regulatory sequences are operably joined when they are covalently linked in such a way as to place the expression or transcription of the coding sequence under the influence or control of the regulatory sequences. If it is desired that the coding sequences be translated into a functional protein, two DNA sequences are said to be operably joined if induction of a promoter in the 5′ regulatory sequences results in the transcription of the coding sequence and if the nature of the linkage between the two DNA sequences does not (1) result in the introduction of a frame-shift mutation, (2) interfere with the ability of the promoter region to direct the transcription of the coding sequences, or (3) interfere with the ability of the corresponding RNA transcript to be translated into a protein. Thus, a promoter region would be operably joined to a coding sequence if the promoter region were capable of effecting transcription of that DNA sequence such that the resulting transcript might be translated into the desired protein or polypeptide.

The C-TAB.G5 or C-TAB.G5.1 isolated polypeptides of the present invention may be produced by expressing the encoding nucleic acid in host cells. The nucleic acid may be transformed or transfected into host cells. Accordingly, some aspects of the present invention include the transformation and/or transfection of nucleic acid encoding the C-TAB.G5 or C-TAB.G5.1 isolated polypeptides. Transformation is the introduction of exogenous or heterologous nucleic acid to the interior of a prokaryotic cell. Transfection is the introduction of exogenous or heterologous nucleic acid to the interior of a eukaryotic cell. The transforming or transfecting nucleic acid may or may not be integrated (covalently linked) into chromosomal DNA making up the genome of the cell. In prokaryotes, for example, the transforming nucleic acid may be maintained on an episomal element such as a plasmid or viral vector. With respect to eukaryotic cells, a stably transfected cell is one in which the transfecting nucleic acid has become integrated into a chromosome so that it is inherited by daughter cells through chromosome replication. This stability is demonstrated by the ability of the eukaryotic cell to establish cell lines or clones comprised of a population of daughter cells containing the transfected nucleic acid.

Higher eukaryotic cell cultures may be used to express the proteins of the present invention, whether from vertebrate or invertebrate cells, including insects, and the procedures of propagation thereof are known (see, for example, Kruse et al. (1973) Tissue Culture, Academic Press).

Host cells and vectors for replicating the nucleic acids and for expressing the encoded C-TAB.G5 or C-TAB.G5.1 isolated polypeptides are also provided. Any vectors or host cells may be used, whether prokaryotic or eukaryotic. Many vectors and host cells are known in the art for such purposes. It is well within the skill of the art to select an appropriate set for the desired application.

DNA sequences encoding toxin A and toxin B, or portions thereof may be cloned from a variety of genomic or cDNA libraries derived from C. difficile and other known toxin A and toxin B expressing prokaryotes known in the art. The techniques for isolating such DNA sequences using probe-based methods are conventional techniques and are well known to those skilled in the art. Probes for isolating such DNA sequences may be based on published DNA or protein sequences. Alternatively, the polymerase chain reaction (PCR) method disclosed by Mullis et al. (U.S. Pat. No. 4,683,195) and Mullis (U.S. Pat. No. 4,683,202), incorporated herein by reference may be used. The choice of library and selection of probes for the isolation of such DNA sequences is within the level of ordinary skill in the art.

Suitable host cells maybe derived from prokaryotes or eukaryotes. Suitable prokaryote hosts include: Pseudomonas such as P. aeruginosa, Escherichia coli, Staphylococcus such as S. aureus and S. epidermidis, Serratia marcescens, Bacillus such as B. subtillis and B. megaterium, Clostridium sporogenes, Enterococcus faecalis, Micrococcus such as M. luteus and M. roseus, and Proteus vulgaris. Suitable host cells for expressing the polypeptides of the present invention in higher eukaryotes include: yeasts such as Saccharomyces (e.g. S. cerevisiae); 293 (human embryonic kidney) (ATCC CRL-1573); 293F (Invitrogen, Carlsbad Calif.); 293T and variant 293T/17 (293tsA1609neo and variant ATCC CRL-11268) (human embryonic kidney transformed by SV40 T antigen); COS-7 (monkey kidney CVI line transformed by SV40)(ATCC CRL1651); BHK (baby hamster kidney cells) (ATCC CRL10); CHO (Chinese hamster ovary cells); mouse Sertoli cells; CVI (monkey kidney cells) (ATCC CCL70); VERO76 (African green monkey kidney cells) (ATCC CRL1587); HeLa (human cervical carcinoma cells) (ATCC CCL2); MDCK (canine kidney cells) (ATCC CCL34); BRL3A (buffalo rat liver cells) (ATCC CRL1442); W138 (human lung cells) (ATCC CCL75); HepG2 (human liver cells) (HB8065); and MMT 060652 (mouse mammary tumor) (ATCC CCL51).

In other embodiments, the present invention provides nucleic acids encoding an isolated polypeptide comprising the C-TAB.G5 or C-TAB.G5.1 isolated polypeptides and additional polypeptides. Vectors useful for constructing eukaryotic expression systems for the production of fusion polypeptides comprise nucleic acid encoding the isolated polypeptide operatively linked to an appropriate transcriptional activation sequence, such as a promoter and/or operator. Other typical features may include appropriate ribosome binding sites, termination codons, enhancers, terminators, or replicon elements. These additional features can be inserted into the vector at the appropriate site or sites by conventional splicing techniques such as restriction endonuclease digestion and ligation.

In some embodiments, additional nucleic acids may be fused to the nucleic acid encoding the C-TAB.G5 or C-TAB.G5.1 isolated polypeptides. The fused nucleic acid may encode polypeptides that may aid in purification and/or immunogenicity and/or stability without shifting the codon reading frame of the C-TAB.G5 or C-TAB.G5.1 isolated polypeptide. The fused nucleic acids may encode a secretory sequence, that may or may not be cleaved from the C-TAB.G5 or C-TAB.G5.1 isolated polypeptides. The fused nucleic acids may not elongate the expressed polypeptide significantly. The fused nucleic acids may encode for less than sixty extra amino acids to the C-TAB.G5 or C-TAB.G5.1 isolated polypeptides. In some embodiments, the fused nucleic acids follow after the nucleic acid encoding the C-TAB.G5 or C-TAB.G5.1 isolated polypeptides. In other embodiments, the fused nucleic acids precede the nucleic acid encoding the C-TAB.G5 or C-TAB.G5.1 isolated polypeptides. In other embodiments, the fused nucleic acids flank the nucleic acid encoding the C-TAB.G5 or C-TAB.G5.1 isolated polypeptides.

In some embodiments, the fused nucleic acids may encode for a polypeptide to aid purification of the C-TAB.G5 or C-TAB.G5.1 isolated polypeptides. In some embodiments the fused nucleic acid will encode for an epitope and/or an affinity tag. Examples of polypeptides that aid purification include, but are not limited to, a His-tag, a myc-tag, an S-peptide tag, a MBP tag, a GST tag, a FLAG tag, a thioredoxin tag, a GFP tag, a BCCP, a calmodulin tag, a Strep tag, an HSV-epitope tag, a V5-epitope tag, and a CBP tag. In other embodiments, the fused nucleic acid may encode for a C-TAB.G5 or C-TAB.G5.1 isolated polypeptide that has a site directed for, or prone to, cleavage. In one embodiment, the fused nucleic acid may encode for polypeptides comprising sites of enzymatic cleavage. In further embodiments, the enzymatic cleavage may aid in isolating the C-TAB.G5 or C-TAB.G5.1 isolated polypeptides, as well as other fused polypeptide segments, from yet other polypeptides. By way of example, an intermediary nucleic acid that encodes for an enzymatic cleavage site placed between nucleic acids that encode for C-TAB.G5 or C-TAB.G5.1 isolated polypeptide and an epitope may allow for later separation of the expressed C-TAB.G5 or C-TAB.G5.1 isolated polypeptides and the epitope. Such sites may also be present between the toxin A portion and the toxin B portion.

The present invention also provides for expression systems designed to assist in expressing and providing the C-TAB.G5 or C-TAB.G5.1 isolated polypeptides. The expression system may comprise a host cell transformed or transfected with a nucleic acid encoding the C-TAB.G5 or C-TAB.G5.1 isolated polypeptide. The host cell may be a prokaryote. The prokaryote may be E. coli. The host cell may be an eukaryotic cell.

The expression system may further comprise agents to aid in selection of host cells successfully transformed or transfected with a nucleic acid encoding the C-TAB.G5 or C-TAB.G5.1 isolated polypeptides. For example, the nucleic acid encoding the C-TAB.G5 or C-TAB.G5.1 isolated polypeptide may further express a gene to assist the host cell in resistance to antibiotics, such as genes to resist kanamycin or gentamycin or ampicillin or penicillin. Such resistant genes will allow for selection of host cells that have properly incorporated the nucleic acid encoding the C-TAB.G5 or C-TAB.G5.1 isolated polypeptide, as is known to those skilled in the art.

Another aspect of the invention is directed to the generation of antibodies. Examples of antibodies encompassed by the present invention, include, but are not limited to, antibodies produced by immunizing a subject with the C-TAB.G5 or C-TAB.G5.1 isolated polypeptide. Antibodies generated by immunizing with the C-TAB.G5 or C-TAB.G5.1 isolated polypeptide may bind specifically to toxin A or toxin B, or they may cross react with the C-TAB.G5 or C-TAB.G5.1 isolated polypeptide. The antibodies produced by the C-TAB.G5 or C-TAB.G5.1 isolated polypeptide of the present invention may be characterized using methods well known in the art.

The antibodies produced by using the C-TAB.G5 or C-TAB.G5.1 isolated polypeptide of the present invention can encompass monoclonal antibodies, polyclonal antibodies, antibody fragments (e.g., Fab, Fab′, F(ab′)2, Fv, Fc, etc.), chimeric antibodies, bispecific antibodies, heavy chain only antibodies, heteroconjugate antibodies, single chain (ScFv), single domain antibodies, variants thereof, isolated polypeptides comprising an antibody portion, humanized antibodies, and any other modified configuration of the immunoglobulin molecule that comprises an antigen recognition site of the required specificity, including glycosylation variants of antibodies, amino acid sequence variants of antibodies, and covalently modified antibodies. Preferred antibodies are derived from murine, rat, human, rabbit, canine, porcine, dromedary, camel, llama, feline, primate, or any other origin (including chimeric, fragment and/or humanized antibodies).

In other embodiments, the antibodies produced by immunizing with the C-TAB.G5 or C-TAB.G5.1 isolated polypeptide are then humanized by methods known in the art. A humanized antibody is an immunoglobulin molecule that contains minimal sequence derived from non-human immunoglobulin. In yet other embodiments, fully human antibodies are obtained by using commercially available mice that have been engineered to express specific human immunoglobulin proteins. In other embodiments, the antibodies are chimeric. A chimeric antibody is an antibody that combines characteristics from two different antibodies. Methods of preparing chimeric antibodies are known in the art.

In other embodiments, the nucleotide sequence that encodes the antibodies is obtained and then cloned into a vector for expression or propagation. In another embodiment, antibodies are made recombinantly and expressed using methods known in the art. By way of example, the C-TAB.G5 or C-TAB.G5.1 isolated polypeptide may be used as an antigen for the purposes of isolating recombinant antibodies by these techniques. Antibodies can be made recombinantly by using the gene sequence to express the antibody recombinantly in host cells. Methods for making variants of antibodies and recombinant antibodies are known in the art.

In other embodiments, the antibodies are bound to a carrier by conventional methods in the art, for use in, for example, isolating or purifying native toxin A or toxin B or detecting native toxin A or toxin B or C. difficile in a biological sample or specimen.

Compositions and Formulations

The present invention also provides compositions comprising C-TAB.G5 or C-TAB.G5.1 isolated polypeptides. The compositions may be pharmaceutical compositions comprising the C-TAB.G5 or C-TAB.G5.1 isolated polypeptide and a pharmaceutically acceptable carrier. The compositions used in the methods of the invention generally comprise, by way of example and not limitation, and effective amount of the C-TAB.G5 or C-TAB.G5.1 isolated polypeptide (e.g., an amount sufficient to induce an immune response) of the invention or an antibody against the C-TAB.G5 or C-TAB.G5.1 isolated polypeptides (e.g., an amount of a neutralizing antibody sufficient to mitigate infection, alleviate a symptom of infection and/or prevent infection). The pharmaceutical composition may further comprise pharmaceutically acceptable carriers, excipients, or stabilizers known in the art (see generally Remington, (2005) The Science and Practice of Pharmacy, Lippincott, Williams and Wilkins).

The C-TAB.G5 or C-TAB.G5.1 isolated polypeptide of the invention may be used for methods for immunizing or treating humans and/or animals with the CDAD. Therefore, the C-TAB.G5 or C-TAB.G5.1 isolated polypeptides may be used within a pharmaceutical composition. The pharmaceutical composition of the present invention may further encompass pharmaceutically acceptable carriers and/or excipients. The pharmaceutically acceptable carriers and/or excipients useful in this invention are conventional and may include buffers, stabilizers, diluents, preservatives, and solubilizers. Remington's Pharmaceutical Sciences, by E. W. Martin, Mack Publishing Co., Easton, Pa., 15th Edition (1975), describes compositions and formulations suitable for pharmaceutical delivery of the polypeptides herein disclosed. In general, the nature of the carrier or excipients will depend on the particular mode of administration being employed. For instance, parenteral formulations usually comprise injectable fluids that include pharmaceutically and physiologically acceptable fluids such as water, physiological saline, balanced salt solutions, aqueous dextrose, glycerol or the like as a vehicle. For solid compositions (e. g. powder, pill, tablet, or capsule forms), conventional non-toxic solid carriers can include, for example, pharmaceutical grades of mannitol, lactose, starch, or magnesium stearate. In addition to biologically neutral carriers, pharmaceutical compositions to be administered can contain minor amounts of non-toxic auxiliary substances, such as wetting or emulsifying agents, preservatives, and pH buffering agents and the like, for example sodium acetate or sorbitan monolaurate.

In one embodiment the pharmaceutical composition may further comprise an immunostimulatory substance, such as an adjuvant. The adjuvant can be selected based on the method of administration and may include mineral oil-based adjuvants such as Freund's complete and incomplete adjuvant, Montanide incomplete Seppic adjuvant such as ISA, oil in water emulsion adjuvants such as the Ribi adjuvant system, syntax adjuvant formulation containing muramyl dipeptide, aluminum hydroxide or aluminum salt adjuvant (alum), polycationic polymer, especially polycationic peptide, especially polyarginine or a peptide containing at least two LysLeuLys motifs, especially KLKLLLLLKLK (SEQ ID NO: 21), immunostimulatory oligodeoxynucleotide (ODN) containing non-methylated cytosine-guanine dinucleotides (CpG) in a defined base context (e.g. as described in WO 96/02555) or ODNs based on inosine and cytidine (e.g. as described in WO 01/93903), or deoxynucleic acid containing deoxy-inosine and/or deoxyuridine residues (as described in WO 01/93905 and WO 02/095027), especially Oligo(dIdC)₁₃ (as described in WO 01/93903 and WO 01/93905), neuroactive compound, especially human growth hormone (described in WO 01/24822), or combinations thereof. Such combinations are according to the ones e.g. described in WO 01/93905, WO 02/32451, WO 01/54720, WO 01/93903, WO 02/13857, WO 02/095027 and WO 03/047602. Preferably, the adjuvant is aluminum hydroxide adjuvant.

Acceptable carriers, excipients, or stabilizers are nontoxic to recipients at the dosages and concentrations that are administered. Carriers, excipients or stabilizers may further comprise buffers. Examples of excipients include, but are not limited to, carbohydrates (such as monosaccharide and disaccharide), sugars (such as sucrose, mannitol, and sorbitol), phosphate, citrate, antioxidants (such as ascorbic acid and methionine), preservatives (such as phenol, butanol, benzanol; alkyl parabens, catechol, octadecyldimethylbenzyl ammonium chloride, hexamethoniuni chloride, resorcinol, cyclohexanol, 3-pentanol, benzalkonium chloride, benzethonium chloride, and m-cresol), low molecular weight polypeptides, proteins (such as serum albumin or immunoglobulins), hydrophilic polymers amino acids, chelating agents (such as EDTA), salt-forming counter-ions, metal complexes (such as Zn-protein complexes), and non-ionic surfactants (such as TWEEN™ and polyethylene glycol).

The pharmaceutical composition of the present invention may further comprise additional agents that serve to enhance and/or complement the desired effect. By way of example, to enhance the immunogenicity the C-TAB.G5 or C-TAB.G5.1 isolated polypeptide of the invention being administered as a subunit vaccine, the pharmaceutical composition may further comprise an adjuvant.

An example of a pharmaceutical composition may be an immunogenic composition. The present invention provides immunogenic compositions comprising the C-TAB.G5 or C-TAB.G5.1 isolated polypeptides. The immunogenic composition may further include a pharmaceutically acceptable carrier or other carriers and/or excipients in a formulation suitable for injection in a mammal. An immunogenic composition is any composition of material that elicits an immune response in a mammalian host when the immunogenic composition is injected or otherwise introduced. The immune response may be humoral, cellular, or both. A booster effect refers to an increased immune response to an immunogenic composition upon subsequent exposure of the mammalian host to the same immunogenic composition. A humoral response results in the production of antibodies by the mammalian host upon exposure to the immunogenic composition.

The immunogenic compositions of the present invention elicit an immune response in a mammalian host, including humans and other animals. The immune response may be either a cellular dependent response or an antibody dependent response or both; and further the response may provide immunological memory or a booster effect or both in the mammalian host. These immunogenic compositions are useful as vaccines and may provide a protective response by the mammalian subject or host to infection by strains of C. difficile.

The present invention further includes methods for producing an immunogenic composition by constructing the nucleic acid encoding the C-TAB.G5 or C-TAB.G5.1 isolated polypeptide and expressing C-TAB.G5 or C-TAB.G5.1 isolated polypeptide component in a microbial host; recovering the C-TAB.G5 or C-TAB.G5.1 isolated polypeptide from a culture of the host; conjugating the C-TAB.G5 or C-TAB.G5.1 isolated polypeptide to a second protein component, and recovering the conjugated protein and polysaccharide component. The nucleic acid encoding the C-TAB.G5 or C-TAB.G5.1 isolated polypeptide may be maintained throughout the growth of the host by constant and stable selective pressure. Maintenance of the expression vector may be conferred by incorporation in the expression vector of a genetic sequence that encodes a selective genotype, the expression of which in the microbial host cell results in a selective phenotype. A selective genotype sequence may also include a gene complementing a conditional lethal mutation. Other genetic sequences may be incorporated in the expression vector, such as other drug resistance genes or genes that complement lethal mutations. Microbial hosts may include: Gram positive bacteria; Gram negative bacteria, such as E. coli; yeasts; filamentous fungi; mammalian cells; insect cells; or plant cells.

The methods of the present invention also provide for a level of expression of the C-TAB.G5 or C-TAB.G5.1 isolated polypeptide in the host at a level greater than about 50 mg/liter of the culture, a level greater than about 100 mg/liter, a level greater than about 500 mg/liter, or a level greater than about 1 g/liter. This invention also provides that the protein may be recovered by any number of methods known to those in the art for the isolation and recovery of proteins, such as by ammonium sulfate precipitation followed by ion exchange chromatography.

The present invention further includes methods for preparing the immunogenic composition that provides that the protein component is conjugated to a second protein component by one of a number of means known to those in the art, such as an amidization reaction.

The present invention also provides formulations comprising the C-TAB.G5 or C-TAB.G5.1 isolated polypeptide for treating and preventing CDAD. In one embodiment, the formulation may include the C-TAB.G5 or C-TAB.G5.1 isolated polypeptide of the present invention, an adjuvant, and a pharmaceutically acceptable carrier. In another embodiment, the formulation includes the C-TAB.G5 or C-TAB.G5.1 isolated polypeptide of the present invention, or consists essentially of one or more C-TAB.G5 or C-TAB.G5.1 isolated polypeptides of the present invention. The formulation may comprise the C-TAB.G5 or C-TAB.G5.1 isolated polypeptide of the present invention and an adjuvant. The formulation may further include an additional antigen or a drug. Moreover, the formulation may include one or more drugs and may in addition to the isolated polypeptide and/or adjuvant include one or more drugs.

The formulation comprising the C-TAB.G5 or C-TAB.G5.1 isolated polypeptide may be in liquid or dry form. A dry formulation may be easily stored and transported. Dry formulations break the cold chain required from the vaccine's place of manufacture to the locale where vaccination occurs. Alternatively, the dry, active ingredient of the formulation per se may be an improvement by providing a solid particulate form that is taken up and processed by antigen presenting cells. These possible mechanisms are discussed not to limit the scope of the invention or its equivalents, but to provide insight into the operation of the invention and to guide the use of this formulation in immunization and vaccination.

Dry formulations of the C-TAB.G5 or C-TAB.G5.1 isolated polypeptide may be provided in various forms: for example, fine or granulated powders, lyophilized powder, uniform films, pellets, and tablets. It may be air dried, dried with elevated temperature, lyophilized, freeze or spray dried, coated or sprayed on a solid substrate and then dried, dusted on a solid substrate, quickly frozen and then slowly dried under vacuum, or combinations thereof. If different molecules are active ingredients of the formulation, they may be mixed in solution and then dried, or mixed in dry form only.

Formulations comprising the C-TAB.G5 or C-TAB.G5.1 isolated polypeptide in liquid or solid form, such as a dry form, may be applied with one or more adjuvants at the same or separate sites or simultaneously or in frequent, repeated applications. The formulation may include other antigens such that administration of the formulation induces an immune response to multiple antigens. In such a case, the other antigens may have different chemical structures so as to induce an immune response specific for different antigens. At least one antigen and/or adjuvant may be maintained in dry form prior to administration. Subsequent release of liquid from a reservoir or entry of liquid into a reservoir containing the dry ingredient of the formulation will at least partially dissolve that ingredient.

Solids (e.g., particles of nanometer or micrometer dimensions) may also be incorporated in the formulation. Solid forms (e.g., nanoparticles or microparticles) may aid in dispersion or solubilization of active ingredients; provide a point of attachment for adjuvant, C-TAB.G5 or C-TAB.G5.1 isolated polypeptide, or both to a substrate that can be opsonized by antigen presenting cells, or combinations thereof. Prolonged release of the formulation from a porous solid formed as a sheet, rod, or bead acts as a depot.

At least one ingredient or component of the formulation (i.e., C-TAB.G5 or C-TAB.G5.1 isolated polypeptide, adjuvant, or drug) may be provided in dry form prior to administration of the formulation. This formulation may also be used in conjunction with conventional enteral, mucosal, or parenteral immunization techniques.

The formulation comprising the C-TAB.G5 or C-TAB.G5.1 isolated polypeptide may be manufactured under aseptic conditions acceptable to appropriate regulatory agencies (e.g., Food and Drug Administration, EMEA for biologicals and vaccines. Optionally, components such as desiccants, excipients, stabilizers, humectants, preservatives, or combinations thereof may be included in the formulation even though they are immunologically inactive. They may, however, have other desirable properties or characteristics.

Processes for manufacturing a pharmaceutical formulation are well known. The components of the formulation may be combined with a pharmaceutically-acceptable carrier or vehicle, as well as any combination of optional additives (e.g., diluents, binders, excipients, stabilizers, desiccants, preservatives, colorings). The use of solid carriers, and the addition of excipients to assist in solubilization of dry components or stabilizers of immunogenic or adjuvant activity, are preferred embodiments. See, generally, Ullmann's Encyclopedia of Industrial Chemistry, 6^(th) Ed. (electronic edition, 2003); Remington's Pharmaceutical Sciences, 22^(nd) (Gennaro, 2005, Mack Publishing); Pharmaceutical Dosage Forms, 2^(nd) Ed. (various editors, 1989-1998, Marcel Dekker); and Pharmaceutical Dosage Forms and Drug Delivery Systems (Ansel et al., 2005, Williams & Wilkins).

Good manufacturing practices are known in the pharmaceutical industry and regulated by government agencies (e.g., Food and Drug Administration, EMEA. Sterile liquid formulations may be prepared by dissolving an intended component of the formulation in a sufficient amount of an appropriate solvent, followed by sterilization by filtration to remove contaminating microbes. Generally, dispersions are prepared by incorporating the various sterilized components of the formulation into a sterile vehicle which contains the basic dispersion medium. For production of solid forms that are required to be sterile, vacuum drying or freeze drying can be used.

In general, solid dosage forms (e.g., powders, granules, pellets, tablets) can be made from at least one active ingredient or component of the formulation.

Suitable tableting procedures are known. The formulation may also be produced by encapsulating solid forms of at least one active ingredient, or keeping them separate from liquids in compartments or chambers. The size of each dose and the interval of dosing to the subject may be used to determine a suitable size and shape of the tablet, capsule, compartment, or chamber.

Formulations will contain an effective amount of the active ingredients (e.g., drug, antigen and adjuvant) together with carrier or suitable amounts of vehicle in order to provide pharmaceutically-acceptable compositions suitable for administration to a human or animal.

The relative amounts of active ingredients, such as amounts of the C-TAB.G5 or C-TAB.G5.1 isolated polypeptide, within a dose and the dosing schedule may be adjusted appropriately for efficacious administration to a subject (e.g., animal or human). This adjustment may also depend on the subject's particular disease or condition, and whether treatment or prophylaxis is intended. To simplify administration of the formulation to the subject, each unit dose contains the active ingredients in predetermined amounts for a single round of immunization.

There are numerous causes of polypeptide instability or degradation, including hydrolysis and denaturation. In the case of denaturation, the conformation or three-dimensional structure of the protein is disturbed and the protein unfolds from its usual globular structure. Rather than refolding to its natural conformation, hydrophobic interaction may cause clumping of molecules together (i.e., aggregation) or refolding to an unnatural conformation. Either of these results may entail diminution or loss of immunogenic or adjuvant activity. Stabilizers may be added to lessen or prevent such problems.

The formulation, or any intermediate in its production, may be pretreated with protective agents (i.e., cryoprotectants and dry stabilizers) and then subjected to cooling rates and final temperatures that minimize ice crystal formation. By proper selection of cryoprotective agents and use of pre-selected drying parameters, almost any formulation might be cryoprepared for a suitable desired end use.

It should be understood in the following discussion of optional additives like excipients, stabilizers, desiccants, and preservatives are described by their function. Thus, a particular chemical may act as some combination of recipient, stabilizer, desiccant, and/or preservative. Such chemical would be immunologically-inactive because it does not directly induce an immune response, but it increases the response by enhancing immunological activity of the antigen or adjuvant: for example, by reducing modification of the antigen or adjuvant, or denaturation during drying and dissolving cycles.

Stabilizers include cyclodextrin and variants thereof (see U.S. Pat. No. 5,730,969). Suitable preservatives such as sucrose, mannitol, sorbitol, trehalose, dextran, and glycerin can also be added to stabilize the final formulation (Howell and Miller, 1983). A stabilizer selected from nonionic surfactants, D-glucose, D-galactose, D-xylose, D-glucuronic acid, salts of D-glucuronic acid, trehalose, dextrans, hydroxyethyl starches, and mixtures thereof may be added to the formulation. Addition of an alkali metal salt or magnesium chloride may stabilize the C-TAB.G5 or C-TAB.G5.1 isolated polypeptide, optionally including serum albumin and freeze-drying to further enhance stability. The C-TAB.G5 or C-TAB.G5.1 isolated polypeptide may also be stabilized by contacting it with a saccharide selected from the group consisting of dextran, chondroitin sulfuric acid, starch, glycogen, insulin, dextrin, and alginic acid salt. Other sugars that can be added include monosaccharides, disaccharides, sugar alcohols, and mixtures thereof (e.g., glucose, mannose, galactose, fructose, sucrose, maltose, lactose, mannitol, xylitol). Polyols may stabilize a polypeptide, and are water-miscible or water-soluble. Suitable polyols may be polyhydroxy alcohols, monosaccharides and disaccharides including mannitol, glycerol, ethylene glycol, propylene glycol, trimethyl glycol, vinyl pyrrolidone, glucose, fructose, arabinose, mannose, maltose, sucrose, and polymers thereof. Various excipients may also stabilize polypeptides, including serum albumin, amino acids, heparin, fatty acids and phospholipids, surfactants, metals, polyols, reducing agents, metal chelating agents, polyvinyl pyrrolidone, hydrolyzed gelatin, and ammonium sulfate.

As an example, the C-TAB.G5 or C-TAB.G5.1 isolated polypeptide formulation can be stabilized in sucrose, trehalose, poly(lactic acid) (PLA) and poly(lactide-co-glycolide) (PLGA) microspheres by suitable choice of excipient or stabilizer (Sanchez et al., 1999). Sucrose, or trehalose may be advantageously used as an additive because it is a non-reducing saccharide, and therefore does not cause aminocarbonyl reactions with substances bearing amino groups such as proteins. Sucrose or trehalose may be combined with other stabilizers such as saccharides.

Additionally, the formulation comprising the C-TAB.G5 or C-TAB.G5.1 isolated polypeptide may include therapeutic agents, such as e.g. anesthetics, analgesics, anti-inflammatories, steroids, antibiotics, antiarthritics, anorectics, antihistamines, and antineoplastics. Examples of such therapeutic agents include lidocaine and nonsteroidal anti-inflammatory drugs (NSAID). In another embodiment, the therapeutic agents are antigens and adjuvants. In still another embodiment, the formulation comprising antigen and/or adjuvant may be applied separately but along with other therapeutic agents, such e.g anesthetics, analgesics, anti-inflammatories, steroids, antibiotics, antiarthritics, anorectics, antihistamines, and antineoplastics. In a preferred embodiment, the antibiotics are fidaxomicin, metronidazole or vancomycin.

The formulation comprising the C-TAB.G5 or C-TAB.G5.1 isolated polypeptide may be delivered via various routes of administration such as e.g. intramuscularly.

Polymers may be added to the formulation and may act as an excipient, stabilizer, and/or preservative of an active ingredient as well as reducing the concentration of the active ingredient that saturates a solution used to dissolve the dry form of the active ingredient. Such reduction occurs because the polymer reduces the effective volume of the solution by filling the “empty” space. Thus, quantities of antigen/adjuvant can be conserved without reducing the amount of saturated solution. An important thermodynamic consideration is that an active ingredient in the saturated solution will be “driven” into regions of lower concentration. In solution, polymers can also stabilize and/or preserve the antigen/adjuvant-activity of solubilized ingredients of the formulation. Such polymers include ethylene or propylene glycol, vinyl pyrrolidone, and 0-cyclodextrin polymers and copolymers.

A single or unit dose of the formulation comprising the C-TAB.G5 or C-TAB.G5.1 isolated polypeptide suitable for administration is provided. The amount of adjuvant and/or C-TAB.G5 or C-TAB.G5.1 isolated polypeptide in the unit dose may be anywhere in a broad range from about 0.001 μg to about 10 mg. This range may be from about 0.1 ug to about 1 mg; a narrower range is from about 5 μg to about 500 μg. Other suitable ranges are between about 20 μg to about 200 μg, such as e.g. about 20 μg, about 75 μg or about 200 μg. A preferred dose for a C-TAB.G5 or C-TAB.G5.1 isolated polypeptide is from about 20 μg or 200 μg or less. The ratio between C-TAB.G5 or C-TAB.G5.1 isolated polypeptide and adjuvant may be about 1:1 or about 1:1.25, but higher ratios may also be used (e.g., about 1:10 or less), or lower ratios of C-TAB isolated polypeptide to adjuvant may also be used (e.g., about 10:1 or more).

The C-TAB.G5 or C-TAB.G5.1 isolated polypeptide may be used as an antigen and may be presented to immune cells, and an antigen-specific immune response is induced. This may occur before, during, or after infection by a pathogen, such as C. difficile. Only C-TAB.G5 or C-TAB.G5.1 isolated polypeptide may be required, but no additional adjuvant, if the immunogenicity of the formulation is sufficient to not require adjuvant activity. The formulation may include an additional antigen such that application of the formulation induces an immune response against multiple antigens (i.e., multivalent). Antigen-specific lymphocytes may participate in the immune response and, in the case of participation by B lymphocytes, antigen-specific antibodies may be part of the immune response. The formulations described above may include desiccants, excipients, humectants, stabilizers, and preservatives known in the art.

The formulation comprising the C-TAB.G5 or C-TAB.G5.1 isolated polypeptide of the present invention may be used to treat a subject (e.g., a human or animal in need of treatment such as prevention of disease, protection from effects of infection, reducing or alleviating the symptoms of a disease, such as CDAD, or combinations thereof). E.g. the formulation comprising the C-TAB.G5 or C-TAB.G5.1 isolated polypeptide of the present invention may be used to treat a subject at risk of CDAD, such as e.g. a subject with the following profile: i) a subject with a weaker immune system such as e.g. an elderly subject (e.g. a subject above 65 years of age) or a subject below 2 years of age; ii) an immunocompromised subject such as e.g. a subject with AIDS; iii) a subject taking or planning to take immunosuppressing drugs; iv) a subject with planned hospitalization or a subject that is in hospital; v) a subject in or expected to go to an intensive care unit (ICU); vi) a subject that is undergoing or is planning to undergo gastrointestinal surgery; vii) a subject that is in or planning to go to a long-term care such as a nursing home; viii) a subject with co-morbidities requiring frequent and/or prolonged antibiotic use; ix) a subject that is a subject with two or more of the above mentioned profiles, such as e.g. an elderly subject that is planning to undergo a gastrointestinal surgery; x) a subject with inflammatory bowel disease; and/or xi) a subject with recurrent CDAD such as e.g. a subject having experienced one or more episodes of CDAD.

The treatment may vaccinate the subject against infection by the pathogen or against its pathogenic effects such as those caused by toxin secretion. The formulation may be used therapeutically to treat existing disease, protectively to prevent disease, to reduce the severity and/or duration of disease, to ameliorate symptoms of disease, or combinations thereof.

The formulations comprising C-TAB.G5 or C-TAB.G5.1 isolated polypeptides may be delivered by various routes of administration including but not limited to oral, subcutaneous, intradermal, intravenous, intra-arterial, intramuscular, intracardial, intraspinal, intrathoracical, intraperitoneal, intraventricular, and/or sublingual routes.

The formulation may also comprise one or more adjuvants or combinations of adjuvants. Usually, the adjuvant and the formulation are mixed prior to presentation of the antigen but, alternatively, they may be separately presented within a short interval of time.

Adjuvants include, for example, an oil emulsion (e.g., complete or incomplete Freund's adjuvant), Montanide incomplete Seppic adjuvant such as ISA, oil in water emulsion adjuvants such as the Ribi adjuvant system, syntax adjuvant formulation containing muramyl dipeptide, aluminum hydroxide or salt adjuvant (ALUM), polycationic polymer, especially polycationic peptide, especially polyarginine or a peptide containing at least two LysLeuLys motifs, especially KLKLLLLLKLK (SEQ ID NO: 21), immunostimulatory oligodeoxynucleotide (ODN) containing non-methylated cytosine-guanine dinucleotides (CpG) in a defined base context (e.g. as described in WO 96/02555) or ODNs based on inosine and cytidine (e.g. as described in WO 01/93903), or deoxynucleic acid containing deoxy-inosine and/or deoxyuridine residues (as described in WO 01/93905 and WO 02/095027), especially Oligo(dIdC)₁₃ (as described in WO 01/93903 and WO 01/93905), neuroactive compound, especially human growth hormone (described in WO 01/24822), or combinations thereof, a chemokine (e.g., defensins 1 or 2, RANTES, MIP1-α, MIP-2, interleukin-8, or a cytokine (e.g., interleukin-1β, -2, -6, -10 or -12; interferon-γ; tumor necrosis factor-α; or granulocyte-monocyte-colony stimulating factor) (reviewed in Nohria and Rubin, 1994), a muramyl dipeptide variant (e.g., murabutide, threonyl-MDP or muramyl tripeptide), synthetic variants of MDP, a heat shock protein or a variant, a variant of Leishmania major LeIF (Skeiky et al., 1995), non-toxic variants of bacterial ADP-ribosylating exotoxins (bAREs) including variants at the trypsin cleavage site (Dickenson and Clements, 1995) and/or affecting ADP-ribosylation (Douce et al., 1997), or chemically detoxified bAREs (toxoids), QS21, Quill A, N-acetylmuramyl-L-alanyl-D-isoglutamyl-L-alanine-2-[1,2-dipalmitoyl-s-glycero-3-(hydroxyphosphoryloxy)]ethylamide (MTP-PE) and compositions containing a metabolizable oil and an emulsifying agent, wherein the oil and emulsifying agent are present in the form of an oil-in-water emulsion having oil droplets substantially all of which are less than one micron in diameter (see, for example, EP 0399843). Also, see Richards et al. (1995) for other adjuvants useful in immunization.

An adjuvant may be chosen to preferentially induce antibody or cellular effectors, specific antibody isotypes (e.g., IgM, IgD, IgA1, IgA2, secretory IgA, IgE, IgG1, IgG2, IgG3, and/or IgG4), or specific T-cell subsets (e.g., CTL, Th1, Th2 and/or T_(DTH)) (see, for example, Munoz et al., 1990; Glenn et al., 1995).

Unmethylated CpG dinucleotides or motifs are known to activate B cells and macrophages (Stacey et al., 1996). Other forms of DNA can be used as adjuvants. Bacterial DNAs are among a class of structures which have patterns allowing the immune system to recognize their pathogenic origins to stimulate the innate immune response leading to adaptive immune responses (Medzhitov and Janeway, 1997, Curr. Opin. Immunol. 9(1): 4-9). These structures are called pathogen-associated molecular patterns (PAMPs) and include lipopolysaccharides, teichoic acids, unmethylated CpG motifs, double-stranded RNA, and mannins. PAMPs induce endogenous signals that can mediate the inflammatory response, act as co-stimulators of T-cell function and control the effector function. The ability of PAMPs to induce these responses play a role in their potential as adjuvants and their targets are APCs such as macrophages and dendritic cells. PAMPs could also be used in conjunction with other adjuvants to induce different co-stimulatory molecules and control different effector functions to guide the immune response, for example from a Th2 to a Th1 response.

Other aspects of the invention is directed toward use of the C-TAB.G5 or C-TAB.G5.1 isolated polypeptide as vaccinating agent. The vaccines or immunogenic compositions of the present invention may employ an effective amount of the antigen. There will be included an amount of antigen which will cause the subject to produce a specific and sufficient immunological response so as to impart protection to the subject from subsequent exposure to C. difficile. The antigen may be the C-TAB.G5 or C-TAB.G5.1 isolated polypeptide. In one embodiment, the C-TAB.G5 or C-TAB.G5.1 isolated polypeptide is administered by itself or in combination with an adjuvant.

Another aspect of the invention includes use of the C-TAB.G5 or C-TAB.G5.1 isolated polypeptide as a subunit vaccine. A subunit vaccine refers to the use of a fragment of a pathogen as an inoculating agent. Those skilled in the art will know subunit vaccines offer a means to generate antibodies to a particular part or region of a pathogen.

Dosage schedule of administration and efficacy of the vaccine can be determined by methods known in the art. The amount of the vaccine and the immunization regimen may depend on the particular antigen and the adjuvant employed, the mode and frequency of administration, and the desired effect (e.g., protection and/or treatment). In general, the vaccine of the invention may be administered in amounts ranging between 1 μg and 100 mg, such as e.g. between 60 μg and 600 μg. A single dose of the vaccine comprising the C-TAB.G5 or C-TAB.G5.1 isolated polypeptide may be in a range from about 1 μg to about 1 mg, preferably from about 5 μg to about 500 μg, more preferably from about 20 μg to about 200 μg. The ratio between C-TAB.G5 or C-TAB.G5.1 isolated polypeptide and adjuvant such as alum may be about 1:1 such as e.g. 1:1.25, but higher ratios may also be used (e.g., about 1:10 or less), or lower ratios may also be used (e.g., about 10:1 or more). In an embodiment, in the vaccine comprising the C-TAB.G5 or C-TAB.G5.1 isolated polypeptide the adjuvant aluminum hydroxide will be used in a range from about 50 μg/mL to about 200 μg/mL, preferably in the amount about 125 μg/mL of the final formulation.

The vaccine comprising the C-TAB.G5 or C-TAB.G5.1 isolated polypeptide can be administered orally, intravenously, subcutaneously, intra-arterially, intramuscularly, intracardially, intraspinally, intrathoracically, intraperitoneally, intraventricularly, and/or sublingually.

The immunization regimen can be determined by methods known in the art. Administration of the vaccine can be repeated as is determined to be necessary by one skilled in the art. For example, a priming dose may be followed by 1, 2, 3 or more booster doses at weekly, bi-weekly or monthly intervals. In an embodiment of the present invention, the priming dose is followed by one or two booster administration in intervals from about 7 to about 14 days such as e.g. after 7 days and 21 days after first prime. In a preferred embodiment, the therapeutically effective amount of the vaccine is administered two or three times in intervals of 14 days +/−1, 2 or 3 days (bi-weekly) to a subject. In an embodiment of the present invention, the therapeutically effective amount of the vaccine is administered once.

Still another aspect is directed to the population which can be treated according to the present invention. In one embodiment, the population includes healthy individuals who are at risk of exposure to C. difficile, especially, the individuals impending hospitalization or residence in a care facility, as well as personals in hospitals, nursing homes and other care facilities. In another embodiment, the population includes previously infected patients who relapsed after discontinuation of antibiotic treatment, or patients for whom antibiotic treatment is not efficient.

In one more embodiment of the invention, the population includes individuals who are at least 18 years or more of age. In one preferred embodiment, the human subject is from 18 to 65 years old. In another preferred embodiment, the human subject is elderly individuals over 65 years of age. The latter age group being the most vulnerable population suffering from C. difficile infections. In some more embodiment, the human subject is younger than 18 years of age.

Methods of Using the C-TAB.G5 or C-TAB.G5.1 Isolated Polypeptide

The present invention also provides methods of using the isolated polypeptide. For example, the C-TAB.G5 or C-TAB.G5.1 isolated polypeptide may be used to prevent or treat diseases associated with C. difficile. By way of example, introducing the isolated polypeptides of the present invention into the immune system of a subject may induce an immune response that includes the subject producing antibodies directed against the isolated polypeptide. Such antibodies are useful for recognizing C. difficile.

The present invention provides methods of delivering isolated polypeptides to a subject comprising administering the isolated polypeptide to a subject. The isolated polypeptide may be administered as a liquid or as a solid. The isolated polypeptide may further include a pharmaceutically acceptable carrier.

The present invention also provides methods for identifying and isolating variable domains of an antibody that recognize and bind to toxin A and or toxin B comprising use of the C-TAB.G5 or C-TAB.G5.1 isolated polypeptide to produce an immune response, purifying and then characterizing the antibodies produced in response to the C-TAB.G5 or C-TAB.G5.1 isolated polypeptide. Identified epitopes may be of use for cloning further antibodies or fragments thereof.

One aspect of the present invention is directed in part to the treatment, the prevention, and the detection of C. difficile. In some embodiments, a subject, such as an animal, receives treatment and/or prevention and/or detection of C. difficile. In other embodiments, the animal is a human. For example, the polypeptides of the present invention may be used to raise antibodies to C. difficile in vivo. By way of further example, the polypeptides of the present invention may be used to determine if a subject produces antibodies to C. difficile. In some embodiments, the polypeptide is used to isolate antibodies. By way of example, polypeptides may be bound to an affinity matrix.

By way of further example, the nucleic acid of the present invention can be used to transform and/transfect cells to recombinantly produce the polypeptides and/or antibodies of the present invention. The nucleic acids of the present invention may also be used, for example, to determine if a subject is infected with C. difficile. By way of example, this can be achieved using methods of radiolabeled hybridization.

By way of further example, the antibodies of the present invention can be used to recognize an infection by C. difficile. By way of example, the antibodies can recognize native toxin A and/or toxin B as an antigen. The antibodies of the present invention can also be used to fight an infection by C. difficile. By way of example, humanized antibodies or antibody fragments or monoclonal antibodies can employ a subject's own immune response to a C. difficile infection. By way of further example, the antibodies of the present invention may be coupled to a cytokine or a toxin or an enzyme or a marker to assist in treating and detecting an infection.

Further aspects of the present invention relate to diagnostic assays. The present invention is of use with many assays known in the art. Those skilled in the art will recognize the wide array of research based uses for the polypeptides, nucleic acids and antibodies of the present invention. The polypeptides, antibodies and nucleic acids of the present invention may, for example, be labeled, such as with a radioactive, chemilu minescent, fluorescent and/or dye molecules. The antibodies, nucleic acids and polypeptides of the present invention lend themselves to use assays for example DNA assays (such as southern blotting), RNA assays (such as northern blotting), protein assays (such as western blotting), chromatographic assays (such as gas, liquid, HPLC, size-exclusion), immunoassays (such as ELISA) and structural assays (such as crystallography and NMR spectroscopy). The antibodies, polypeptides and nucleic acids of the present invention may further be used as probes. Assays which amplify the signals from a probe are also known to those skilled in the art.

Kits

The present invention provides kits comprising by way of example, and not limitation, nucleic acids encoding the C-TAB.G5 or C-TAB.G5.1 isolated polypeptide, the C-TAB.G5 or C-TAB.G5.1 isolated polypeptide, and/or antibodies against the C-TAB.G5 or C-TAB.G5.1 isolated polypeptide. The kits may include one or more containers and instructions for use in accordance with any of the methods of the invention described herein. The C-TAB.G5 or C-TAB.G5.1 isolated polypeptide and/or antibodies of the invention may be used in a variety of assays including immunoassays for detecting C. difficile. In one embodiment, the C-TAB.G5 or C-TAB.G5.1 isolated polypeptide serves to function as an antigen for the purposes of detecting antibody in biological samples. The containers may be unit doses, bulk packages (e.g., multi-dose packages) or sub-unit doses. The kits of this invention are in suitable packaging. Also contemplated are packages for use in combination with a specific device, such as an inhaler, nasal administration device or an infusion device. A kit may have a sterile access port. The container may also have a sterile access port. Kits may optionally provide additional components such as buffers and interpretive information.

The kits may be used to detect the presence of C. difficile or to detect a disease associated with C. difficile, such as CDAD. The kits may be used to prevent or treat diseases associated with C. difficile. The kits of the present invention may also be used to alleviate the symptoms of a disease associated with C. difficile.

Without further description, it is believed that one of ordinary skill in the art can, using the preceding description and the following illustrative examples, make and utilize the claimed invention. The following working examples therefore, specifically point out preferred embodiments of the present invention, and are not to be construed as limiting in any way the remainder of the disclosure. All articles, publications, patents and documents referred to throughout this application are hereby incorporated by reference in their entirety.

EXAMPLES Example 1: Preparation of the C-TAB.G5 and C-TAB.G5.1 Isolated Polypeptides

This Example describes the preparation of isolated polypeptides comprising portions of the C. difficile toxins A (CTA) and B (CTB) for expression in E. coli cells. The method described below can be used for making various isolated polypeptides comprising CTA and CTB. As an example, an isolated polypeptide comprising a portion of the C-terminal domain of CTA and a portion of the C-terminal domain of CTB is described.

Example 1.1: Cloning of the C-TAB.G5 and C-TAB.G5.1 Gene Constructs

The portion of CTA gene (Accession No. YP-001087137) encoding amino acids 2026 to 2710 of the C-terminal domain was amplified by PCR from genomic DNA of C. difficile strain 630 (ATCC BAA-1382) using the following primers:

(SEQ ID NO: 9) forward: 5′- caccACTAGTatgaacttagtaactggatggc -3′ and (SEQ ID NO: 10) reverse: 5′- CTCGAGttagccatatatcccaggggc -3′. Amplification with the forward primer created a SpeI site, and amplification with the reverse primer created of a Xhol site.

The portion of CTB gene (Accession No: YP-00108735) encoding amino acids 1850 to 2366 of the C-terminal domain was amplified by PCR using the following primers:

(SEQ ID NO: 11) forward: 5′- caccATGCATatgagtttagttaatagaaaacag -3′ and (SEQ ID NO: 12) reverse: 5′- ggcCTCGAGctattcactaatcactaattgagc -3′. Amplification with the forward primer created a Nsil site, and amplification with the reverse primer created a Xhol site.

PCR reactions were performed using PCR Super-Mix (Invitrogen). The cycle conditions was 95° C. for 2 minutes, 95° C. for 45 seconds, 55° C. for 50 seconds, 68° C. for 8 minutes (30 cycles), and 72° C. for 10 minutes. The PCR products were purified with Quick gene extraction kit (Invitrogen) and ligated into the PCR 2.1 TOPO vector (Invitrogen). The ligation mixtures were used to transform E. coli Mech-1 cells by heat shock. The transformants were plated on plates of ImMedia Amp Blue (Invitrogen). White colonies were picked and cultured in 15 ml tubes with 4 ml of LB medium containing 100 μg/ml ampicillin. Cultures were incubated overnight at 37° C. and plasmids were extracted with Quick plasmid miniprep kit (Invitrogen).

The CTA gene fragment in the PCR 2.1-TOPO/TA vector was digested with SpeI and Xhol, and the fragment was cloned into an intermediate vector, also digested with SpeI and Xhol, using T4 DNA Ligase. A linker containing three restriction sites (BgLII-NsiI-SacI) was then inserted at the 3′ end of the CTA gene fragment by PCR using the following set of synthetic primers:

(SEQ ID NO: 13) forward: 5′- AGATCTATGCATGAGCTCctcgagcccaaaacgaaaggctcagc - 3′ (SEQ ID NO: 14) reverse: 5′- cggtccggggccatatatcccaggggcttttactcc -3′.

The CTB gene fragment in PCR 2.1-TOPO/TB was digested with Nsil and Xhol, and the digested CTB gene fragment was ligated to the intermediate vector containing the CTA gene and linker, which was also digested with Nsil and Xhol. The CTB gene was inserted 3′ to the linker giving the construct sequence 5′-CTA-linker-CTB-3′. This fusion construct is referred to as C-TAB.V1 intermediate vector.

The C-TAB.G5 gene was amplified by PCR from C-TAB.V1 intermediate vector using the primers:

(SEQ ID NO: 15) forward: 5′- caccCCATTGatggtaacaggagtatttaaagga (SEQ ID NO: 16) reverse: 5′ - CTCGAGctattcactaatcactaattgagctg. PCR reactions were performed using PCR Super mix (Invitrogen). The cycle condition was 95° C. for 2 minutes, 95° C. for 45 seconds, 55° C. for 50 seconds, 68° C. for 4 minutes (30 cycles) and 72° C. for 10 minutes. The PCR products were purified with Quick gene extraction kit (Invitrogen) and ligated into the PCR2.1-TOPO vector (Invitrogen). The ligation mixtures were used to transform E. coli Mech-1 cells by heat shock. The transformants were plated on plates of ImMedia Amp Blue (Invitrogen). White colonies were picked and cultured in 15 ml tubes with 4 ml of LB medium containing 100 μg/ml ampicillin. Cultures were incubated overnight at 37° C. and plasmids were extracted with Quick plasmid mini-prep kit (Invitrogen). The C-TAB.G5 fusion gene in the PCR 2.1-TOPOTA vector was digested with Ncol and Xhol restriction enzyme. These C-TAB fragments were ligated into the pET28 expression vector digested with the same restriction enzymes. This resulting construct encodes the toxin A C-terminal domain from amino acids 2272 to 2710 fused to toxin B C-terminal domain from amino acids 1851 to 2366. The pET28/C-TAB.G5 construct was transformed into E. coli BL21 (DE3) for expression. Five colonies containing the C-TAB.G5 fusion gene were selected for analysis.

The C-TAB.G5.1 coding sequence was obtained by codon optimization for improved expression within an E. coli host cells. The codon usage was adapted to the codon bias of E. coli genes. In addition, GC content was adjusted to prolong mRNA half life; a region of very high (>80%) or very low (<30%) GC content have been avoided. Therefore, the optimized gene allows high and stable expression rates in E. coli. The codon optimized C-TAB.G5.1 gene was synthesized in situ and subcloned into the expression vector pET-28b(+).

DNA Sequencing: Plasmid DNA sequences were confirmed using dye terminator cycle sequencing chemistry with d-Rhodamine dyes. Sequencing data were analyzed using Jellyfish software.

Example 1.2: Expression of the Recombinant C-TAB.G5 or C-TAB.G5.1 Fusion Proteins in E. coli

Expression of C-TAB.G5 and C-TAB.G5.1 gene constructs may be done using standard procedure for expression in E. coli.

Screening colonies for expression of the recombinant C-TAB fusion protein: For the purpose of screening, colonies were picked and grown in 15 ml Falcon tubes with 4 ml of LB media with 50 μg/ml kanamycin. The tubes were cultured overnight at 37° C. with mixing at 250 rpm. Following initial growth phase, 1 ml of culture from each tube was transferred to a 24-well tissue culture plate and expression was induced with 1 mM isopropyl-β-D-1-thiogalacto-puranoside (IPTG) for 3 h at 30° C. The cell pellets were collected by centrifugation at 12,000 g for 1 min in microcentrifuge. Cell pellet lysates were prepared, and the soluble fraction was assayed by SDS-PAGE and Western Blot analysis for expression of C-TAB fusion protein. Positive clones were selected for further evaluation.

Batch fermentation for C-TAB.G5 expression: Seed cultures were grown in five 500 ml shake flasks each containing 150 ml Super Broth medium supplemented with 30 μg/ml kanamycin. Cultures were grown for 12 h at 28° C. with continuous agitation at 275 rpm until OD₆₀₀ reached 2-2.5. The shake flasks were used to inoculate a fermenter containing 10 L Super Broth. The culture was grown approximately 4.5 h at 37° C. to OD₆₀₀=3.5-4. For induction of the product expression 0.1 mM IPTG was added and growth continued for additional 4 h at 25° C. Then the cells were harvested by centrifugation and the cell paste stored frozen at −70° C. A typical product specific expression rate achieved by this fermentation process was about 200 mg/ml.

Fed-batch fermentation for C-TAB.G5.1 preparation: An aliquot of 500 μl of the glycerol stock of a seed bank (stored at −75° C.) was used to inoculate 100 ml pre-culture medium supplemented with 30 μg/ml kanamycin in a 1 L shake flask. The pre-culture was incubated at 37° C. under constant agitation at ˜150 rpm for approximately 7 h until it reached OD₆₀₀=1.0-2.0. 25 mL of pre-culture was used to inoculate 7 L batch fermentation medium in a standard industry 15 L fermenter equipped with process control system, able to perform fed-batch fermentations. 7 L batch culture phase was carried for 12 h at 37° C. (OD₆₀₀=12-15) until glucose was exhausted. Glucose feed phase (biomass production) was then initiated by an exponential feed mode at a specific growth rate constant μ=0.25/h at 37° C. for 6 h (OD₆₀₀=40-50). One hour before switching to a constant feed phase and induction with a final concentration of 1 mM IPTG (product production), temperature was reduced to 30° C. to lower the risk of inclusion body formation. Product expression phase was continued for another 5 h with constant feed at 30° C. (OD₆₀₀=˜100), resulting in a total fermentation process time of 23 h and a final culture volume of ˜8.2 L. A wet cell biomass of about 1.2 kg was harvested by centrifugation and stored at ≤−70° C. A typical product specific expression rate reached by such fed-batch fermentation was up to 1.3 g/L.

Example 1.3: Purification of the Recombinant C-TAB.G5 or C-TAB.G5.1 Fusion Proteins

Purification of C-TAB.G5 analytical sample: Frozen cell paste was thawed and resuspended in 10 mM citric acid/NaOH buffer at pH 5.6, and the cell slurry was passed two times through a homogenizer (GEA Niro Soavi homogenizer) at 550 bar. The suspension was centrifuged two times: once at 13500 rpm for 30 minutes and the second time at 18000 rpm in an ultracentrifuge for one hour. The supernatants were pooled, and the pH adjusted to 5.6 with 50 mM citric acid buffer pH 3. Clarified cell lysates were passed over a SP fast flow column with 10 mM citric acid/NaOH buffer at pH 5.6. Proteins were eluted with a liner gradient of sodium chloride increasing from 0 to 500 mM in 20 mM NaPi. Fractions containing the C-TAB.G5 were pooled. The conductivity was adjusted down to 5 mS/cm with distilled H₂O. Tris was added to a 25 mM final concentration. The pooled fractions were passed over a DEAE fast flow column. Protein was eluted with a linear gradient of sodium chloride increasing from 50 to 500 mM in 25 mM Tris. Again, fractions containing C-TAB.G5 were pooled and 1.5 M Na-Citrate, pH 7.5 was added to a final concentration of 0.4 M. The C-TAB.V1 pool was loaded onto a phenol Sepharose HP column equilibrated with 25 mM Tris, 0.4 M Na-Citrate pH 7.5. C-TAB.G5 fusion protein was eluted with a reducing salt concentration in a liner gradient using 5 mM Tris, pH 7.5. All columns were monitored by an AKTA Prime chromatography system. Purified C-TAB fusion protein was buffer exchanged to PBS using a 50 K membrane.

Purification of C-TAB.G5.1 bulk preparation: Biomass was stored at −80° C. until processing. 450 g frozen cell paste (equivalent to 2.90 L fermenter) is diluted with 4 volumes of lysis buffer (20 mM Hepes, pH 7.5, ˜0.6 mS/cm) (e.g. 450 g paste+1800 mL buffer) and thawed by this way for ˜1 h±0.5 h under mechanical agitation. Optional, remaining clumps can be resuspended using an Ultraturrax (e.g. 5 min at 8000 rpm). Cell lysis is done on a Niro Soavi Panda high homogenizer (640±25 bar, 3 cycles). The lysate is cooled down to <10° C. using a heat exchanger and kept at this temperature until centrifugation. The crude cell lysate is submitted to a batch centrifugation step (Beckmann Avanti JLA 60.25) operated at 14000 rpm (30000 g) at 4° C. for 30 min. The supernatants are collected and pooled. The semi-liquid part of the pellet is discarded too, to decrease the risk of clogging the filtration step. The pooled supernatants are then filtered through a Supercap PDH4 100/5 inch depth filter capsule (Pall) (250 cm² effective filtration area). The remaining lysate in the filter housing is flushed out with lysis buffer. After clarification, an aliquot of 1M Tris stock solution, pH 7.5 is added to the lysate to a final concentration of 25 mM. The buffer composition of final lysate is 20 mM Hepes, 25 mM Tris, pH 7.5, conductivity ˜6 mS/cm). The lysate might be still slightly turbid after filtration, but this does not affect the following capture step. Capture step is performed at room temperature with DEAE Sepharose FF (GE Healthcare) in a XK50/30 column (GE Healthcare) of following dimensions: diameter 50 mm, packed bed height 20 cm, packed bed volume ˜400 mL. The loading density is approx. 0.8 to 1.2 g biomass/mL gel. The process is run by an Äkta Explorer system (GE Healthcare) and monitored at 280 nm. Equilibration is performed at 100 cm/h with approx. 5 CV of 25 mM Tris, 20 mM Hepes, 25 mM NaCl, pH 7.5, conductivity ˜5 mS/cm until pH, conductivity and 280 nm absorbance are stable. The lysate is loaded onto the column at 75 cm/h and the flow through is discarded. When all filtrated lysate is loaded, flow is resumed with approx. 5 CV of equilibration buffer until the 280 nm absorbance is stabilized. Impurities are removed from the column during wash step 2 with 5 CV of 25 mM Tris, 175 mM NaCl, pH 7.5, conductivity 19 mS/cm. The C-TAB protein is eluted from the column by step elution with 3 CV of 25 mM Tris, 375 mM NaCl, pH 7.5, conductivity 36 mS/cm. The collection of the C-TAB containing fractions begins when 280 nm absorbance starts to increase (usually after 1 CV) and lasts for about 0.5 to 1.0 CV. The pooled fractions containing C-TAB can be stored at 2-8° C. over night. Intermediate purification step is done with SP-Sepharose FF (GE Healthcare) in a XK50/30 column (GE Healthcare) at room temperature with the following dimensions: diameter 50 mm, packed bed height 20 cm, packed bed volume ˜400 mL. The maximum loading density is approx. 4-5 mg C-TAB/mL gel. The process is run by an Akta Explorer system (GE Healthcare) and monitored at 280 nm. Equilibration, washing and linear gradient elution steps are performed at a maximum flow rate of 200 cm/h (65 mL/min) unless exceeding back pressure (>4 bar) prevents it. Equilibration is performed with approx. 5-10 CV of buffer G at 200 cm/h until pH, conductivity and 280 nm absorbance are stable. Before loading, the DEAE pool has to be adjusted to allow binding of C-TAB on SP-FF resin. DEAE pool is diluted 25 fold with SP-FF equilibration buffer (10 mM citric acid, 2 mM EDTA, pH 5.5±0.1, conductivity ˜2 mS/cm) to a final conductivity of not more than 3.5 mS/cm, pH 5.5±0.1. If necessary additional MilliQ water is added to achieve the desired conductivity. Note that low conductivity is very critical to allow binding of C-TAB onto SP-FF. The sample is loaded onto the column at 150 cm/h and the flow through is discarded. After loading the sample, flow is resumed with approx. 5 CV of equilibration buffer at 200 cm/h until the 280 nm absorbance is stabilized. Elution is done by linear gradient at 100 cm/h from 0% equilibration buffer to 30% 20 mM sodium phosphate, 500 mM NaCl, pH 7.0 over 10 CV. Fractions are collected and pooling is performed by UV 280 nm absorbance. Pooling starts at 15% of peak maximum and ends at 15% of peak maximum. The pool is immediately adjusted to 400 mM citrate (final pH 7, approx. 49 mS/cm) using a 1.5 M citrate stock solution, pH 8.0. The adjusted SPFF pool should have pH 7 and approx. 49 mS/cm and is stored at 2-8° C. over night.

Polishing chromatography step is performed with Phenyl-Sepharose HP (GE Healthcare) in a XK50/30 column (GE Healthcare) at room temperature with the following dimensions: diameter 50 mm, packed bed height 15 cm, packed bed volume ˜300 mL. The loading density is approx. 4-5 mg C-TAB/mL gel. The process is run by an Äkta Explorer system (GE Healthcare) and monitored at 280 nm. Equilibration, loading, washing and elution steps are performed at a maximum flow rate of 100 cm/h (33 mL/min) unless exceeding back pressure (>4 bar) prevents it. In such a case the flow rate has to be reduced. Equilibration is performed with approx. 5-10 CV of 25 mM Tris, 400 mM sodium citrate, pH 7.5, 46 mS/cm at 100 cm/h until pH, conductivity and 280 nm absorbance are stable. The sample is loaded onto the column at 100 cm/h and the flow through is discarded. After loading the sample, flow is resumed with approx. 5 CV of equilibration buffer at 100 cm/h until the 280 nm absorbance is stabilized. Elution is done by linear gradient at 100 cm/h from 100% equilibration buffer/0% 5 mM Tris, pH 7.5, 0.5 mS/cm to 100% 5 mM Tris, pH 7.5, 0.5 mS/cm over 20 CV. Fractions are collected and pooling is performed by UV280 nm absorbance. Pooling starts at approx. 10-15% of peak maximum and ends at approx. 20% of peak maximum. The adjusted pool is stored at 2-8° C. over night. Preparation of final C-TAB drug substance protein solution is achieved by 30 kDa cut-off tangential flow filtration (TFF, Pellicon 2 membrane, Millipore) operated at room temperature. The protein solution is diafiltered against formulation buffer (20 mM Histidine, 75 mM NaCl, 5% Sucrose, 0.025% Tween®80, pH 6.5) until the permeate pH equals 6.5±0.2).

Final protein concentration is adjusted to 2 mg/mL according to UV measurement at 280 nm using 1.566 as the specific extinction coefficient at 280 nm for C-TAB (protein conc. 1 mg/mL, 1 cm cuvette).

SDS-PAGE and Western Blot Analysis: Whole cell lysates and purified C-TAB.G5 or C-TAB.G5.1 fusion protein were resuspended in Nu-Page sample buffer containing beta-mercaptoethanol and boiled for 10 min. Samples (25 μl) were loaded onto 3-8% Tris-Acetate gel. Following electrophoresis (150 V for 1 h), proteins were visualized by staining the gels with simply blue stain or used for Western blot analysis.

C-TAB.G5 or C-TAB.G5.1 specific expression was determined by Western blot analysis using toxin-specific antibodies. Proteins were transferred at 23 V for 60 min onto a PVDF membrane using 1× Transfer buffer in 10% methanol. Membranes were blocked for 1 h at room temperature with 0.5% casein in phosphate buffered saline (PBS). Transfer membranes were incubated for 2 hrs at room temperature with either a monoclonal antibody against Toxin B (GenWay; clone B426M) or an in-house derived Guinea Pig polyclonal antibody against Toxin A (List Biological Labs). Washed membranes were incubated with horseradish peroxidase conjugated anti-guinea pig IgG or anti-mouse IgG. The blots were washed and AEC substrates were added. The blots were incubated with gentle mixing for 5-10 minutes. The blots were rinsed with water to stop color development.

RBC hemagglutination: The cell binding domain of toxin A but not toxin B has been shown to be capable of agglutinating rabbit red blood cells (RBCs). The agglutination process is the result of the binding of toxin A to a glycan sequence found on blood antigens on rabbit RBCs. Samples (C-TAB.G5 and native toxin A) are diluted to 100 μg/ml in PBS. In a V-bottom microtiter plate, two-fold serial dilutions are prepared in duplicate across the plate, starting at 100 μg/ml and leaving 50 μl of the dilution in each well. Fifty microliters of a 0.75% rabbit RBC/PBS suspension is added to each well of the microtiter plate and the plate is incubated for 1 h at room temperature. Hemagglutination is indicated by the failure to form a pellet of RBCs on the bottom of the plate. The hemagglutination titer of a sample is represented by the concentration of protein present in the well with the highest sample dilution in which no RBC pellet is observed.

Example 2: Dose Titration of the Recombinant C-TAB.G5 Fusion Protein in the Presence and Absence of Alum in Mice

This study was to determine the feasibility of an in vivo dose titration of C-TAB.G5 with and without alum adjuvant as a C-TAB potency assay. The alum utilized was Alydragel, (alum hydroxide, Brenntag). C57BL/6 female mice (Charles River Labs.), aged between 8 and 9 weeks, were utilized for immunization. All animals received a first immunization by intramuscular (IM) injection (50 μl) into the right thigh muscle on day 0. The second immunization was done by IM injection into the left thigh muscle on day 14. A total of 72 mice were divided into 12 groups vaccinated as follows:

-   -   Group 1: PBS only     -   Group 2: 100 (154) ng C-TAB.G5     -   Group 3: 300 (462) ng C-TAB.G5     -   Group 4: 1,000 (1,540) ng C-TAB.G5     -   Group 5: 3,000 (4,620) ng C-TAB.G5     -   Group 6: 10,000 (15,400) ng C-TAB.G5     -   Group 7: PBS with 50 μg alum     -   Group 8: 10.0 (15.4) ng C-TAB.G5 with 50 μg alum OH     -   Group 9: 30.0 (46.2) ng C-TAB.G5 with 50 μg alum OH     -   Group 10: 100 (154) ng C-TAB.G5 with 50 μg alum OH     -   Group 11: 300 (462) ng C-TAB.G5 with 50 μg alum OH     -   Group 12: 1,000 (1,540) ng C-TAB.G5 with 50 μg alum OH

In this study the protein concentration was firstly determined according to the standard protocol Quick Start™ Bradford Protein Assay (Bio-Rad). Lately, the protein concentration (shown in parentheses) was re-determined by UV measurement at 280 nm according to the procedure described in Example 1.3. In all follow-up studies the protein concentration was measured by UV method.

Blood samples were collected from all animals two weeks after the first immunization (study day 14) and two weeks after the second immunization (study day 28). The serum was stored at −20° C. until analyzed.

Serum IgG ELISA: Serum antibodies elicited to C-TAB.G5 or C-TAB.G5.1 (referred as C-TAB), toxin A and toxin B or toxoids thereof were evaluated in an enzyme linked immunosorbent assay (ELISA). Briefly, stock solutions of 1.0 μg/ml of toxin A, toxin B or the C-TAB.G5 isolated polypeptide were prepared in PBS and 100 μl were added to each well of a 96-well plates. After overnight incubation at 4° C., the plates were washed and blocked with 0.5% casein blocking buffer. Plates were washed again and serial, two-fold dilutions of test sera added to the plates. After a second overnight incubation at 4° C., plates were washed and incubated with peroxidase-conjugated anti-mouse IgG (H+L). After a 2 hours incubation at room temperature, the plates were again washed, peroxidase substrate (2,2′-azinobis(3-ethylbenzthiazoline-6-sulfonate) added and color allowed to develop for 2 h at room temperature. The reaction was stopped by adding 50 μl of 2% SDS to the wells. Plates are read with an ELISA plate reader at an absorbance of 405 nm. Serum antibody titers are reported as the geometric mean of ELISA Units, which are the serum dilutions that results in an OD 405 nm reading of 1.0. As a negative control a pool sample of pre-immune serum obtained from animals pre-bled before the first immunization was used to evaluate an antibody response.

Animals receiving C-TAB.G5 demonstrated a dose dependent increase in antibody titers, with the alum adjuvant allowing for significantly improved antibody titers at a lower dose of C-TAB.G5. FIG. 3 shows the titers for anti C-TAB, anti-toxin A and anti-toxin B IgG. FIG. 4 shows a graphical comparison of antibody titers in the presence or absence of alum.

Example 3: Immunogenicity and Protective Efficacy of C-TAB.G5 in Mice

This study was to evaluate the immunogenicity and protective efficacy of C-TAB.G5 in vaccinated mice receiving a lethal challenge of C. difficile toxin A or toxin B. Female C57BL/6 mice (Charles River Labs.), aged 6-7 weeks, were utilized for this study. All animals received the first vaccination by intramuscular (IM) injection (50 μl) into the right thigh muscle on day 0. The second vaccination was done by IM injection into the left thigh muscle on day 14. 116 mice were divided into groups vaccinated as follows:

-   -   Group 1: PBS only     -   Group 2: 3 μg C-TAB.G5     -   Group 3: 10 μg C-TAB.G5     -   Group 4: 30 μg C-TAB.G5     -   Group 5: 3 μg C-TAB.G5+50 μg alum OH     -   Group 6: 10 μg C-TAB.G5+50 μg alum OH     -   Group 7: 30 μg C-TAB.G5+50 μg alum OH     -   Group 8:: PBS only     -   Group 9: 3 μg C-TAB.G5     -   Group 10: 10 μg C-TAB.G5     -   Group 11: 30 μg C-TAB.G5     -   Group 12: 3 μg C-TAB.G5+50 μg alum OH     -   Group 13: 10 μg C-TAB.G5+50 μg alum OH     -   Group 14: 30 μg C-TAB.G5+50 μg alum OH

Blood samples were collected from all animals two weeks after the second immunization (study day 28). The serum was stored at −20° C. until analyzed. Serum antibody titers to C-TAB, toxin A and toxin B were then determined by ELISA and reported as ELISA Units (EU).

FIG. 5 shows serum antibody titers to C-TAB, toxin A and toxin B in mice evaluated two weeks after the second immunization (study day 28). This study demonstrated that the C-TAB.G5 fusion protein is highly immunogenic in mice and is able to induce strong antibody response against both toxin A and toxin B even without adding an adjuvant. The C-TAB.G5 immunogenicity can be significantly augmented (more than a one log) by co-delivery with alum hydroxide. The animals receiving C-TAB.G5 with or without alum demonstrated 2-fold increased antibody response over a one log dose range.

Besides evaluating antibody titers, the antibodies generated by immunization with C-TAB.G5 were assessed for their ability to neutralize native toxin A and B in in vitro toxin neutralization assay (TNA).

Toxin Neutralizing Antibody Assay (TNA). For in vitro analysis, 125 μl of either toxin A (5 ng/ml) or toxin B (1 ng/ml) was incubated with 125 μl of serial dilutions of anti-sera obtained from immunized mice. After one hr of incubation at 37° C., the toxin:serum mixture was added to microtiter wells containing Vero cells (monkey kidney cells), and the microtiter plates incubated for 18 hr. Incubation of either toxin A or B with Vero cells resulted in a change in cell morphology and a loss of cell adherence which was measured by neutral red staining of toxin treated cells after removal of non-adherent cells. The toxin neutralization titer of a serum is reported as the serum dilution which gives a 50% reduction in toxin activity.

The results of the TNA assay are shown in FIG. 6. The data indicate that antibodies generated following immunization with the C-TAB.G5 alone are capable of neutralizing the toxic activity of native toxin A but not toxin B. When the C-TAB.G5 was co-delivered with alum, TNA titers were augmented with approximately 6-fold increase in anti-toxin A TNA and only 2-fold lower titers in anti-toxin B TNA. This data indicates that the C-TAB.G5 isolated polypeptide not only retains the antibody recognition antigenic epitopes present in the native toxins, but comprises critical antigenic epitopes required for the generation of functional toxin neutralizing antibody. Thus, C-TAB.G5 is effective in neutralizing toxic effects of C. difficile toxin A and toxin B and, therefore, is useful in vaccination.

In addition to assessing antibody response, the ability of C-TAB.G5 immunization to protect mice from a lethal challenge of native toxins was determined. Three weeks after the second vaccination (study day 35) animals in vaccinated and non-vaccinated groups (N=8) received intraperitoneally (IP) a lethal dose of either 25 ng of toxin A or 50 ng of toxin B. Survival of the mice was monitored over the following 9 days and the results are shown in FIG. 6. This experiment demonstrated that immunization of mice with C-TAB.G5 in the absence of the alum adjuvant was capable of conferring 100% protection against a lethal challenge with native toxin A and 50% protection against toxin B challenge. Co-delivery of C-TAB.G5 with Alum enhanced the protective immunity to toxin B up to 100% protection. This data indicates that C-TAB.G5 vaccination induces an immune response sufficient to protect mice from the toxic effects of both toxin A and B in the lethal challenge model.

Example 4: Evaluation of the Immunogenicity and Protective Efficacy of C-TAB.G5 in Young and Aged Mice

This study was to compare the immune response mounted against C-TAB.G5 in young and aged mice. Female C57BL/6 mice (Charles River Labs.), aged 6-7 weeks and 18 months, respectively, were utilized for this study. All animals received the first vaccination by intramuscular (IM) injection (50 μl) into the right thigh muscle on day 0. The second vaccination was done by IM injection into the left thigh muscle on day 14. 192 mice were divided into groups vaccinated as follows:

-   -   Group 1: PBS to young mice     -   Group 2: PBS to aged mice     -   Group 3: 10 μg C-TAB.G5 to young mice     -   Group 4: 30 μg C-TAB.G5 to young mice     -   Group 5: 10 μg C-TAB.G5 to aged mice     -   Group 6: 30 μg C-TAB.G5 to aged mice     -   Group 7: 10 μg C-TAB.G5+50 μg alum OH to young mice     -   Group 8: 30 μg C-TAB.G5+50 μg alum OH to young mice     -   Group 9: 10 μg C-TAB.G5+50 μg alum OH to aged mice     -   Group 10: 30 μg C-TAB.G5+50 μg alum OH to aged mice     -   Group 11: PBS to young mice     -   Group 12: PBS to aged mice     -   Group 13: 10 μg C-TAB.G5 to young mice     -   Group 14: 30 μg C-TAB.G5 to young mice     -   Group 15: 10 μg C-TAB.G5 to aged mice     -   Group 16: 30 μg C-TAB 5^(th) to aged mice     -   Group 17: 10 μg C-TAB.G5+50 μg alum OH to young mice     -   Group 18: 30 μg C-TAB.G5+50 μg alum OH to young mice     -   Group 19: 10 μg C-TAB.G5+50 μg alum OH to aged mice     -   Group 20: 30 μg C-TAB.G5+50 μg alum OH to aged mice

Three weeks after the second vaccination (study day 35) animals in vaccinated and non-vaccinated groups (N=6) received a lethal challenge by intraperitoneal (IP) injection with 25 ng toxin A or 50 ng toxin B. Survival of the mice was monitored over the following 9 days.

Blood samples were collected from all animals two weeks after the first immunization (study day 14) and two weeks after the second immunization (study day 28). The serum was stored at −20° C. until analyzed. Serum antibody titers to C-TAB, toxin A and toxin B were then determined by and reported as ELISA Units (EU). Toxin A and toxin B neutralizing antibodies (TNA) were determined using Vero cells treated with a cytotoxic amount of recombinant toxin A and toxin B.

Young animals receiving the C-TAB.G5 vaccine demonstrated significantly higher levels of all antibodies tested, as compare to old animals. Especially high antibody titers were obtained in young mice vaccinated with C-TAB.G5 in the presence of alum hydroxide (FIG. 7). Particularly significant improvement was achieved in toxin B TNA titer. At the same time, there was no big difference between young and aged mice in ability to withstand the toxin A and toxin B challenges. However, both groups demonstrated improved protection rate when vaccinated in the presence of alum. FIG. 7 shows a comparison of C-TAB.G5 immunogenicity and protective efficacy in young vs. old mice. FIG. 8 shows the kinetics of anti-C-TAB antibody development in young and old mice.

Example 5: Comparison of the Immunogenicity and Protective Efficacy of C-TAB.G5.1 and Toxoid A and B

This study was to compare the immunogenicity and protective efficacy of C-TAB.G5.1, vs. toxoid A/B. The toxoid A/B used was the mixture of equal parts (1:1) of toxoid A (lot #1009132) and toxoid B (lot #1009133). Toxoid was prepared by formalin fixation and provided by TechLab. Female C57BL/6 mice (Charles River Labs.), aged 6-7 weeks, were utilized for this study. All animals received the first vaccination by intramuscular (IM) injection (50 μl) into the right thigh muscle on day 0. The second vaccination was done by IM injection into the left thigh muscle on day 14. 180 mice were divided into groups vaccinated as follows:

-   -   Group 1: PBS only     -   Group 2: 10 μg C-TAB.G5.1     -   Group 3: 30 μg C-TAB.G5.1     -   Group 4: 10 μg C-TAB.G5.1+50 μg alum OH     -   Group 5: 10 μg C-TAB.G5.1+50 μg alum OH     -   Group 6: 30 μg toxoid A/B     -   Group 7: 10 μg toxoid A/B     -   Group 8: 30 μg toxoid A/B+50 μg alum OH     -   Group 9: 30 μg toxoid A/B+50 μg alum OH     -   Group 10: PBS     -   Group 11: 10 μg C-TAB.G5.1     -   Group 12: 30 μg C-TAB.G5.1     -   Group 13: 10 μg C-TAB.G5.1+50 μg alum OH     -   Group 14: 30 μg C-TAB.G5.1+50 μg alum OH     -   Group 15: 10 μg toxoid A/B     -   Group 16: 30 μg toxoid A/B     -   Group 17: 10 μg toxoid A/B+50 μg alum OH     -   Group 18: 30 μg toxoid A/B+50 μg alum OH

Three weeks after the second vaccination (study day 35) animals in vaccinated and non-vaccinated groups (N=6) received a lethal challenge by intraperitoneal (IP) injection with 28 ng toxin A or 50 ng of toxin B. Survival of the mice was monitored over the following 9 days.

Blood samples were collected from all animals two weeks after the first immunization (study day 14) and two weeks after the second immunization (study day 28). The serum was stored at −20° C. until analyzed. Serum antibody titers to C-TAB, toxin A and toxin B were then determined by ELISA and reported as ELISA Units (EU). Toxin A and toxin B neutralizing antibodies (TNA) were determined using Vero cells treated with a cytotoxic amount of recombinant toxin A and toxin B.

This study demonstrates immunogenicity and protective efficacy of C-TAB.G5.1 and toxoid A/B in mice after two vaccinations. Animals receiving C-TAB.G5.1 showed lower but significant anti-C-TAB antibody titers, as compare to animals receiving toxoid A/B. Also, co-delivery of alum greatly augmented all tested antibody responses. As a result, the level of anti-C-TAB and anti-toxin A antibodies achieved in animals immunized either with C-TAB.G5.1 or with toxoid A/B in the presence of alum are similar. The only lower antibody titer was observed for anti-toxin B antibody when mice were immunized with C-TAB.G5.1, as compare to mice immunized with toxoid A/B. Noteworthy, unlike the antibodies generated against C-TAB.G5.1 recognizing epitopes in the C-terminal portion of the toxin molecules, antibodies induced with toxoid immunization were specific to the N-terminal portion of the toxin molecules, which was read out in the anti-toxin ELISA. Thus, anti-toxin A and anti-toxin B antibodies generated in mice immunized with C-TAB.G5.1 and toxoid A/B were antibodies of different specificity and, therefore, can not be compared directly. However, the data indicates that antibody response to C-TAB.G5.1 immunization is significantly high, like in case of immunization with toxoids. In addition, the toxin challenge study demonstrated that ability of C-TAB.G5.1 immunization to protect mice against from a lethal challenge is comparable to protection efficacy of toxoid A and B. FIG. 9 shows a comparison of the immunogenicity of C-TAB.G5.1 and toxoid A/B. FIG. 10 shows the toxin neutralization and protection data for mice immunized with C-TAB.G5.1 as compared to those immunized with toxoid A/B.

Example 5.1: Comparison of Antibody Titers and Protective Efficacy of C-TAB.G5.1 in Different Immunization Regimens

This study was to compare the immunogenicity and protective efficacy of C-TAB.G5.1 in mice receiving three doses of the vaccine in different immunization regimens. Female C57BL/6 mice (Charles River Labs.), aged 6-7 weeks, were utilized for this study. 135 mice were divided into 14 groups. All animals in groups no. 2-13 received three vaccinations by intramuscular (IM) injection (50 μl) into the right thigh muscle or left thigh muscle in days indicated below. Mice in groups 1 and 8 did not received vaccination; they served as a negative control. The immunization was performed as follows:

Group Vaccine Immunization day (route) 1 — — 2 10 μg G5.1 0 (R), 3 (L), 14 (R) 3 10 μg G5.1 + 50 μg Alum 0 (R), 3 (L), 14 (R) 4 10 μg G5.1 0 (R), 7 (L), 21 (R) 5 10 μg G5.1 + 50 μg Alum 0 (R), 7 (L), 21 (R) 6 10 μg G5.1 0 (R), 14 (L), 28 (R) 7 10 μg G5.1 + 50 μg Alum 0 (R), 14 (L), 28 (R) 8 — — 9 10 μg G5.1 0 (R), 3 (L), 14 (R) 10 10 μg G5.1 + 50 μg Alum 0 (R), 3 (L), 14 (R) 11 10 μg G5.1 0 (R), 7 (L), 21 (R) 12 10 μg G5.1 + 50 μg Alum 0 (R), 7 (L), 21 (R) 13 10 μg G5.1 0 (R), 14 (L), 28 (R) 14 10 μg G5.1 + 50 μg Alum 0 (R), 14 (L), 28 (R) (R) = right thigh muscle (L) - left thigh muscle

Blood samples were collected from all animals on study day 0, 3, 7, 14, 21, 28, 35 and 42. The serum was stored at −20° C. until analyzed. Serum antibody titers to C-TAB, toxin A and toxin B were determined by ELISA and reported as ELISA Units (EU). Toxin A and toxin B neutralizing antibodies (TNA) were determined on study day 42 using Vero cells treated with a cytotoxic amount of recombinant toxin A and toxin B.

Three weeks after the last vaccination (study day 49) animals in vaccinated and non-vaccinated groups (N=8) received a lethal challenge by intraperitoneal (IP) injection with 28 ng toxin A or 50 ng toxin B. Survival of the mice was monitored over the following 9 days.

This study demonstrates that all antibody titers measured two weeks after the third vaccination or on study day 35 and 42 are comparable in all immunization regimens, although the immunization regimen of 0/14/28 shows the best antibody responses. If compare antibody titers measured two weeks after the second vaccination, then the immunization regimen of 0/14/28 is better than the immunization regimen of 0/7/21, and much better than the immunization regimen of 0/3/14. This study confirmed that anti-toxin A/B antibody titers are significantly enhanced when the antigen is co-injected with aluminum hydroxide (data not shown). The study also shows that even two doses of the vaccine with alum administered in two-week interval can elicit high antibody level, comparable to the level obtained after three dose vaccinations.

The level of toxin A/B neutralizing antibodies is much higher in the immunization regimen of 0/7/21 and 0/14/28 than in the immunization regimen of 0/3/14.

Complete protection against challenge with toxin A does not require alum in the immunization regimen of 0/7/21 and 0/14/28 but not in 0/3/14. The immunization regimen of 0/14/28/with alum induced a highest level of protection (87.5%) against toxin B challenge, while the immunization regimen of 0/7/21 provides 37.5% of protection and 0/3/14 shows 28.6% of protection. Results of this study are shown in FIGS. 20A and 20B.

Example 6: Evaluation of the Immunogenicity and Protective Efficacy of the Recombinant C-TAB.G5.1 Fusion Protein in Hamsters

This study was to further evaluate the immunogenicity of the recombinant fusion protein C-TAB.G5.1 administered with or without adjuvant in a different animal model.

Female hamsters (Harlan), aged over 7 weeks and weighing between 80 and 90 g were utilized for this study. All animals received the first vaccination by bolus (50 μl) intramuscular (IM) injection into the right thigh muscle on day 0. The second vaccination was by IM injection into the left thigh muscle on day 14 and the third vaccination was by IM injection on day 28. Hamsters were divided into groups (N=6) and vaccinated as follows:

-   -   Group 1: Formulation buffer only     -   Group 2: 10 μg C-TAB.G5.1     -   Group 3: 10 μg C-TAB.G5.1+100 μg alum OH     -   Group 4: 30 μg C-TAB.G5.1     -   Group 5: 30 μg C-TAB.G5.1+100 μg alum OH     -   Group 6: 100 μg C-TAB.G5.1     -   Group 7: 100 μg C-TAB.G5.1+100 μg alum OH     -   Group 10: Formulation buffer only     -   Group 11. 10 μg C-TAB.G5.1     -   Group 12. 10 μg C-TAB.G5.1+100 μg alum OH     -   Group 13. 30 μg C-TAB.G5.1     -   Group 14. 30 μg C-TAB.G5.1+100 μg alum OH     -   Group 15 100 μg C-TAB.G5.1     -   Group 16. 100 μg C-TAB.G5.1+100 μg alum OH

Two weeks after the third vaccination (study day 42) animals in vaccinated and non-vaccinated groups (N=6) received a lethal challenge by intraperitoneal (IP) injection with 75 ng toxin A or 125 ng toxin B. An extra 12 hamsters were used for a dose titration of toxin A or toxin B challenge on the day 44. Survival of the hamsters was monitored over the following 8 days.

Blood samples were collected from all animals two weeks after the first immunization (study day 14), after the second immunization (study day 28) and the third immunization (study day 35). The serum was stored at −20° C. until analyzed. Serum antibody titers to C-TAB, toxin A and toxin B were then determined by ELISA and reported as ELISA Units (EU). Toxin A and toxin B neutralizing antibodies (TNA) were determined using Vero cells treated with a cytotoxic amount of recombinant toxin A and toxin B.

This study demonstrated that hamsters, similarly to mice, were able positively respond to the C-TAB.G5.1 vaccination. Animals receiving C-TAB.G5.1 demonstrated a dose dependent increase in all tested antibody titers, while the alum adjuvant significantly improved antibody titers at all doses of C-TAB.G5. The highest antibody titers were observed two weeks after the second shots (study day 28). FIGS. 11A-11C show antibody titers for each group of immunized hamsters. FIG. 12 shows the kinetics of anti-C-TAB antibody development in hamsters immunized with C-TAB.G5 in the presence or absence of alum hydroxide.

The results of the TNA assay are shown in FIG. 13. These results are similar to those obtained for mice and indicate that antibody generated against the C-TAB.G5.1 fusion protein in hamsters are effective in neutralizing toxic effects of C. difficile toxin A and toxin B.

FIG. 13 also shows protection data for hamsters immunized with C-TAB.G5.1 following a lethal toxin challenge. High protection was achieved even by vaccination with C-TAB.G5.1 in the absence of the adjuvant. The protection level was improved to 100% by adding alum to the vaccine.

Example 7: The Protective Efficacy of the C-TAB.G5.1 Fusion Protein Against a C. difficile Spore Challenge in Clindamycin-Treated Hamsters

Following antibiotic treatment C. difficile can colonize the gut and, if toxigenic, may cause an antibiotic associated diarrhea. C. difficile associated disease (CDAD) of humans is modeled in hamsters using clindamycin to make the animals susceptible to colonization, diarrhea and death, usually within a few days after seeding with a toxigenic strain. To assess the efficacy of the C-TAB.G5.1 vaccine, vaccinated and non-vaccinated hamsters were challenged with clindamycin and C. difficile strain 630. 100 μg of C-TAB.G5.1 was mixed with 125 μg alum-hydroxide adjuvant. Female adult hamsters weighing ˜100 g received 3 vaccinations by intramuscular (IM) injection on days 0, 14 and 28. The placebo was PBS. 48 hamsters were divided into groups of 8 as vaccinated as follows:

-   -   Group 1: PBS only+10² spore challenge     -   Group 2: C-TAB.G5.1+10² spore challenge     -   Group 3: PBS only+10³ spore challenge     -   Group 4: C-TAB.G5.1+10² spore challenge     -   Group 5: PBS only+10⁴ spore challenge     -   Group 6: C-TAB.G5.1+10⁴ spore challenge

On day 42 all animals in all groups received an oral dose of 10 mg clindamycin phosphate/kg body weight. On day 43 all animals in all groups were dosed by oral gavage with washed spores of C. difficile strain 630. Three levels of spore challenge were used (˜10², 10³ and 10⁴). Observation, but no treatment, continued until day 54. At study termination, all surviving animals were disease free for ≥5 days.

Blood samples were drawn to obtain serum for serological studies on day 0, 14, 28, 42 and day 54 (end of study). Feces were collected on days 1 and 42, directly from the anus of the hamsters, or if needed, from among the bedding.

Results are shown on FIG. 14 demonstrating survival curves after spore challenge in hamsters. Survival data was plotted as Kaplan-Meier survival fit curves and statistical analysis was done using a log rank analysis. At all spore doses, 100% survival of hamsters in the vaccinated group was observed and survival was significantly enhanced when compared to the placebo group: p=0.0245 at 10² spores, p=0.0006 at 10³ spores, p<0.0001 at 10⁴ spores.

Example 8: Immunogenicity and Protection Efficacy of C-TAB.G5.1 in Monkeys

This study was to evaluate the immunogenicity and protection of C-TAB.G5.1 in cynomolgus monkeys. Six female cynonomolgus monkeys, aged between 4 and 6 years and weighing between 2 and 4 kg, were used for this study. Two groups of three monkeys were arranged, the first group (Group 1) receiving 200 μg of C-TAB.G5.1 and the second (Group 2) receiving 200 μg of C-TAB.G5.1 and 250 μg alum. As alum adjuvant Rehydragel (Reheis, lot#534401, dilute in PBS to 2 mg/ml) was used. Before blood collection or immunization, animals were shaved (if necessary).

The 1^(st) (study day 0) and 3^(rd) (study day 28) immunizations were injected on the left arm (deltoid), the 2^(nd) immunization (study day 14) was injected to the right arm (deltoid). Group 1 received 200 μg C-TAB.G5.1 alone in 0.5 ml 1×PBS by IM injection and Group 2 received 200 μg C-TAB.G5.1 with 250 μg alum in 0.5 ml 1×PBS by IM injection.

At the established time points (study days 0, 14, 28 and 42), 2-3 mL of whole blood was obtained by standard methods into serum separator tubes. Serum samples were frozen at approximately −20° C. ELISA method was then used to assess anti-C-TAB, anti-toxin A and anti-toxin B IgG titers. Antibody titers were presented in ELISA Units (EU).

FIG. 15 shows that increased doses of C-TAB.G5.1 lead to increased antibody production recognizing all three proteins, while the presence of alum significantly improved antibody levels. The highest antibody titers were observed with two vaccinations on day 42. These data clearly indicate feasibility of using the recombinant C-TAB.G5 or C-TAB.G5.1 fusion proteins for vaccination subjects in need thereof.

Example 9: Comparison of the Immunogenicity of C-TAB.G5 and C-TAB.G5.1

This study was to compare the immunogenicity of C-TAB.G5 and C-TAB.G5.1 as well as the effect of two different buffers in which the C-TAB was delivered in. C57BL/6 female mice (Charles River Labs.), aged between 8 and 9 weeks, were utilized for immunization. All animals received the first immunization by intramuscular (IM) injection (50 μl) into the right thigh muscle on day 0. The second immunization was done by IM injection into the left thigh muscle on day 14. A total of 72 mice were divided into 12 groups vaccinated as follows:

Group 1: 1 μg C-TAB.G5 in PBS

Group 2: 3 μg C-TAB.G5 in PBS

Group 3: 10 μg C-TAB.G5 in PBS

Group 4: 30 μg C-TAB.G5 in PBS

Group 5: 1 μg C-TAB.G5 in histidine buffer

Group 6: 3 μg C-TAB.G5 in histidine buffer

Group 7: 10 μg C-TAB.G5 in histidine buffer

Group 8: 30 μg C-TAB.G5 in histidine buffer

Group 9: 1 μg C-TAB.G5.1 in histidine buffer

Group 10: 3 μg C-TAB.G5.1 in histidine buffer

Group 11: 10 μg C-TAB.G5.1 in histidine buffer

Group 12: 30 μg C-TAB.G5.1 in histidine buffer

Blood samples were collected from all animals two weeks after the second immunization (study day 28). The serum was stored at −20° C. until analyzed. Serum antibody titers to C-TAB, toxin A and toxin B were determined by ELISA and reported as ELISA Units.

FIG. 16 shows that all antibody titers (anti-C-TAB, anti-toxin A and anti-toxin B) were not significantly different (as revealed by T-test analysis) over 1-30 μg dose range for three vaccine formulations. Slightly higher antibody production was achieved with C-TAB.G5 formulation in histidine buffer, as compare to PBS. No significant difference was observed between immunization with C-TAB.G5 and C-TAB.G5.1 histidine formulations. Thus, this study demonstrates the equal immunogenicity of C-TAB.G5 and C-TAB.G5.1 constructs.

Example 10: Preparation and Evaluation of the Alternative C-TABNCTB and C-TADCTB Fusion Proteins

This Example describes the preparation of two other fusion proteins comprising one portion of the C-terminal domain of CTA and two portions of the C-terminal domain of CTB derived from C. difficile VPI-10463 strain. The C-TABNCTB fusion protein (SEQ ID NO: 18) comprises, like C-TAB.G5, 19 repeating units of CTA (amino acids 2272-2710), 23 repeating units of CTB (amino acids 1850-2366), plus additional 10 repeats of CTB (amino acids 1834-2057) fused to the C-terminus of CTB. The C-TADCTB fusion protein (SEQ ID NO: 20) comprises C-TAB.G5 sequence (19 repeats of CTA and 23 repeats of CTB) plus additional 24 repeating units of CTB (amino acids 1834-2366) fused to the C-terminus of C-TAB.G5. Thus, C-TADCTB comprises a double portion of repeating units of CTB. Cloning of the C-TABNCTB and C-TADCTB gene constructs was done in a way similar to that described in Example 1. 1 The recombinant fusion proteins were expressed in E. coli cells and purified using standard procedure as described in Example 1.2. The isolated polypeptides were evaluated in the immunogenicity and protection studies in animals.

Example 10.1: Comparison of the Immunogenicity and Protective Efficacy of C-TAB.G5, C-TABNCTB and C-TADCTB in Mice

This study was to compare the immunogenicity and protective efficacy of C-TAB.G5, C-TABNCTB and C-TADCTB in mice vaccinated with five antigen doses over a two log range. Female C57BL/6 mice (Charles River Labs.), aged 6-7 weeks, were utilized for this study. All animals received two vaccinations: the first one by intramuscular (IM) injection (50 μl) into the right thigh muscle on day 0. The second vaccination was done by IM injection into the left thigh muscle on day 14. All immunizations were done in the absence of alum. Blood samples were collected two weeks after the second immunization (study day 28). The serum was stored at −20° C. until analyzed. Serum antibody titers to C-TAB, toxin A and toxin B were determined by ELISA and reported as ELISA Units (EU) shown in FIG. 17.

This study demonstrated that the alternative fusion proteins C-TADCTB and C-TABNCTB, as well as C-TAB.G5, are highly immunogenic and able to induce strong antibody response against both toxin A and toxin B even without adding an adjuvant.

In addition to assessing antibody response, the ability of C-TADCTB and C-TABNCTB immunization to protect mice from a lethal challenge of native toxin B was determined. Three weeks after the second vaccination (study day 35) animals in vaccinated and non-vaccinated groups (N=6) received intraperitoneally (IP) a lethal dose of 50 ng of toxin B. Survival of the mice was monitored over the following 9 days and the results are shown in FIG. 18. This experiment demonstrated that immunization of mice with 33 μg of C-TADCTB in the absence of alum was capable of conferring 100% protection against a lethal challenge with native toxin B, while the same dose of C-TAB.G5 and C-TABNCTB induces only partial protection. This data indicates that, similarly to C-TAB.G5, two other fusion proteins C-TADCTB and C-TABNCTB may be protective against the lethal challenge with the native toxin.

Example 10.2: Comparison of the Immunogenicity and Protective Efficacy of C-TAB.G5.1 and C-TADCTB in Hamsters

This study was to further evaluate the immunogenicity of the alternative fusion protein C-TADCTB administered with or without alum adjuvant in a different animal model.

The study was designed as described in Example 6: female hamsters were vaccinated three times by IM injection (study day 0, 14 and 28) in the presence or absence of 100 μg alum hydroxide. Two weeks after the third vaccination (study day 42) all animals received a lethal challenge by intraperitoneal (IP) injection with 75 ng toxin A or 125 ng toxin B. Blood samples were collected on study day 14, 28 and 35 and serum antibody titers to C-TAB, toxin A and toxin B were determined by ELISA. Toxin A and toxin B neutralizing antibodies (TNA) were measured in day 35 sera. Survival of the hamsters was monitored and reported as % of protection.

This study demonstrated that the fusion protein C-TADCTB can induce anti-toxin antibody response in hamsters, similarly to mice. The alum adjuvant significantly improved all tested antibody titers. The results of the TNA assay shown in FIG. 19 indicate that antibody generated against C-TADCTB are effective in neutralizing toxic effects of C. difficile toxin A and toxin B. FIG. 19 also demonstrates comparison of protection data for hamsters immunized either with C-TAB.G5.1 or with C-TADCTB. High protection was achieved by vaccination with both recombinant fusion proteins.

Example 11: An Open-Label Phase 1 Study Assessing the Safety, Immunogenicity and Dose Response of a Pharmaceutical Composition Comprising C-TAB.G5.1

The pharmaceutical composition comprising C-TAB.G5.1, a recombinant fusion protein consisting of truncated Clostridium difficile (C. difficile) Toxin A and Toxin B, which will be administered at three different doses: 20 μg with Al(OH)₃ (alum), 75 and 200 μg without or with Al(OH)₃, respectively, intramuscular (IM) injection, three vaccinations on Day 0, 7 and 21.

Study Objectives

Primary:

-   -   To investigate the safety and tolerability of a pharmaceutical         composition comprising C-TAB.G5.1 up to 6 months after the third         vaccination.         Secondary:     -   To investigate the immune response measured against the vaccine         antigen C-TAB.G5.1 and the native Toxins A and B of C. difficile         to three different doses and two formulations on Days 0, 7, 14,         21, 28, 113, 201 after the first vaccination to obtain a first         indication of the optimal dose and formulation.     -   To investigate the capacity of C-TAB.G5.1 vaccine-induced IgG         antibodies to neutralize C. difficile Toxins A and B in vitro.         Study Design

This is an open-label, partially randomized, dose escalation Phase 1 study which will consist of a part A in healthy adults aged between ≥18 and <65 years and a part B in healthy elderly ≥65 years, the latter age group being the most vulnerable population to suffer from C. difficile infections. Part A will be conducted with vaccination schedule Day 0, 7 and 21 in five treatment groups of 12 healthy adult subjects to study safety and dose response to 20 μg C-TAB.G5.1 vaccine with adjuvant, and to 75 μg and 200 μg of C-TAB.G5.1 vaccine with or without adjuvant, respectively. Safety and immunogenicity will be analyzed after all adult subjects of part A have received the third vaccination, all safety data will be reviewed by a Data Safety Monitoring Board (DSMB) prior to enrollment of subjects from part B. In case non-safe or futile treatment groups (i.e., doses that do not induce considerable IgG responses) are identified during the interim analysis, these treatment groups will be dropped and not carried forward to part B.

Part B of the study will seek dose confirmation in the elderly population. Accordingly, Part B will be conducted in 5 treatment groups of 20 elderly healthy subjects per group. Vaccination schedule Day 0, 7 and 21 will be applied. This study design will allow to compare dose responses in both adults and elderly. The latter age group will be the major target population for a C. difficile vaccine, representing the most vulnerable population for the two target indications in the development pathway of a C. difficile vaccine, i.e. prevention of recurrent C. difficile diarrhea and prevention of primary C. difficile infection in an age-based or age-risk based preventive vaccination approach. However, elderly subjects might be less responsive to vaccination than young adults; thus, dose confirmation in the elderly target population from an early development stage on is required. An interim analysis after all adults from part A have been vaccinated will allow to drop non-safe or doses/formulations which do not induce considerable IgG responses in adults in order to mitigate the risk of exposing subjects in the elderly group to potentially unsafe or futile doses (e.g. lowest dose) and/or formulations (e.g. non-adjuvanted formulation) of the vaccine. The C-TAB.G5.1 vaccine is an aqueous solution of C-TAB.G5.1 in 20 mM L-Histidine, 75 mM NaCl, 5% Sucrose, 0.025% Tween®80; pH6.5 produced by standard methods.

SEQUENCES: SEQ ID Name NOs: Sequences C-TAB.G5 1 ATGGTAACAGGAGTATTTAAAGGACCTAATGGATTTGAGTATTTTGC (nucleic ACCTGCTAATACTCACAATAATAACATAGAAGGTCAGGCTATAGTTT acid ACCAGAACAAATTCTTAACTTTGAATGGCAAAAAATATTATTTTGAT sequence) AATGACTCAAAAGCAGTTACTGGATGGCAAACCATTGATGGTAAAA AATATTACTTTAATCTTAACACTGCTGAAGCAGCTACTGGATGGCAA ACTATTGATGGTAAAAAATATTACTTTAATCTTAACACTGCTGAAGC AGCTACTGGATGGCAAACTATTGATGGTAAAAAATATTACTTTAATA CTAACACTTTCATAGCCTCAACTGGTTATACAAGTATTAATGGTAAA CATTTTTATTTTAATACTGATGGTATTATGCAGATAGGAGTGTTTAAA GGACCTAATGGATTTGAATACTTTGCACCTGCTAATACTCATAATAA CAACATAGAAGGTCAAGCTATACTTTACCAAAATAAATTCTTAACTT TGAATGGTAAAAAATATTACTTTGGTAGTGACTCAAAAGCAGTTACC GGATTGCGAACTATTGATGGTAAAAAATATTACTTTAATACTAACAC TGCTGTTGCAGTTACTGGATGGCAAACTATTAATGGTAAAAAATACT ACTTTAATACTAACACTTCTATAGCTTCAACTGGTTATACAATTATTA GTGGTAAACATTTTTATTTTAATACTGATGGTATTATGCAGATAGGAG TGTTTAAAGGACCTGATGGATTTGAATACTTTGCACCTGCTAATACA GATGCTAACAATATAGAAGGTCAAGCTATACGTTATCAAAATAGATT CCTATATTTACATGACAATATATATTATTTTGGTAATAATTCAAAAGC AGCTACTGGTTGGGTAACTATTGATGGTAATAGATATTACTTCGAGC CTAATACAGCTATGGGTGCGAATGGTTATAAAACTATTGATAATAAA AATTTTTACTTTAGAAATGGTTTACCTCAGATAGGAGTGTTTAAAGG GTCTAATGGATTTGAATACTTTGCACCTGCTAATACGGATGCTAACA ATATAGAAGGTCAAGCTATACGTTATCAAAATAGATTCCTACATTTA CTTGGAAAAATATATTACTTTGGTAATAATTCAAAAGCAGTTACTGG ATGGCAAACTATTAATGGTAAAGTATATTACTTTATGCCTGATACTG CTATGGCTGCAGCTGGTGGACTTTTCGAGATTGATGGTGTTATATATT TCTTTGGTGTTGATGGAGTAAAAGCCCCTGGGATATATGGCAGATCT ATGCATAATTTGATAACTGGATTTGTGACTGTAGGCGATGATAAATA CTACTTTAATCCAATTAATGGTGGAGCTGCTTCAATTGGAGAGACAA TAATTGATGACAAAAATTATTATTTCAACCAAAGTGGAGTGTTACAA ACAGGTGTATTTAGTACAGAAGATGGATTTAAATATTTTGCCCCAGC TAATACACTTGATGAAAACCTAGAAGGAGAAGCAATTGATTTTACTG GAAAATTAATTATTGACGAAAATATTTATTATTTTGATGATAATTATA GAGGAGCTGTAGAATGGAAAGAATTAGATGGTGAAATGCACTATTTT AGCCCAGAAACAGGTAAAGCTTTTAAAGGTCTAAATCAAATAGGTG ATTATAAATACTATTTCAATTCTGATGGAGTTATGCAAAAAGGATTT GTTAGTATAAATGATAATAAACACTATTTTGATGATTCTGGTGTTATG AAAGTAGGTTACACTGAAATAGATGGCAAGCATTTCTACTTTGCTGA AAACGGAGAAATGCAAATAGGAGTATTTAATACAGAAGATGGATTT AAATATTTTGCTCATCATAATGAAGATTTAGGAAATGAAGAAGGTGA AGAAATCTCATATTCTGGTATATTAAATTTCAATAATAAAATTTACTA TTTTGATGATTCATTTACAGCTGTAGTTGGATGGAAAGATTTAGAGG ATGGTTCAAAGTATTATTTTGATGAAGATACAGCAGAAGCATATATA GGTTTGTCATTAATAAATGATGGTCAATATTATTTTAATGATGATGGA ATTATGCAAGTTGGATTTGTCACTATAAATGATAAAGTCTTCTACTTC TCTGACTCTGGAATTATAGAATCTGGAGTACAAAACATAGATGACAA TTATTTCTATATAGATGATAATGGTATAGTTCAAATTGGTGTATTTGA TACTTCAGATGGATATAAATATTTTGCACCTGCTAATACTGTAAATG ATAATATTTACGGACAAGCAGTTGAATATAGTGGTTTAGTTAGAGTT GGGGAAGATGTATATTATTTTGGAGAAACATATACAATTGAGACTGG ATGGATATATGATATGGAAAATGAAAGTGATAAATATTATTTCAATC CAGAAACTAAAAAAGCATGCAAAGGTATTAATTTAATTGATGATATA AAATATTATTTTGATGAGAAGGGCATAATGAGAACGGGTCTTATATC ATTTGAAAATAATAATTATTACTTTAATGAGAATGGTGAAATGCAAT TTGGTTATATAAATATAGAAGATAAGATGTTCTATTTTGGTGAAGAT GGTGTCATGCAGATTGGAGTATTTAATACACCAGATGGATTTAAATA CTTTGCACATCAAAATACTTTGGATGAGAATTTTGAGGGAGAATCAA TAAACTATACTGGTTGGTTAGATTTAGATGAAAAGAGATATTATTTT ACAGATGAATATATTGCAGCAACTGGTTCAGTTATTATTGATGGTGA GGAGTATTATTTTGATCCTGATACAGCTCAATTAGTGATTAGTGAATA G C-TAB.G5 2 MVTGVFKGPNGFEYFAPANTHNNNIEGQAIVYQNKFLTLNGKKYYFDN (amino DSKAVTGWQTIDGKKYYFNLNTAEAATGWQTIDGKKYYFNLNTAEAA acid TGWQTIDGKKYYFNTNTFIASTGYTSINGKHFYFNTDGIMQIGVFKGPN sequence) GFEYFAPANTHNNNIEGQAILYQNKFLTLNGKKYYFGSDSKAVTGLRTI DGKKYYFNTNTAVAVTGWQTINGKKYYFNTNTSIASTGYTIISGKHFYF NTDGIMQIGVFKGPDGFEYFAPANTDANNIEGQAIRYQNRFLYLHDNIY YFGNNSKAATGWVTIDGNRYYFEPNTAMGANGYKTIDNKNFYFRNGLP QIGVFKGSNGFEYFAPANTDANNIEGQAIRYQNRFLHLLGKIYYFGNNS KAVTGWQTINGKVYYFMPDTAMAAAGGLFEIDGVIYFFGVDGVKAPGI YGRSMHNLITGFVTVGDDKYYFNPINGGAASIGETIIDDKNYYFNQSGV LQTGVFSTEDGFKYFAPANTLDENLEGEAIDFTGKLIIDENIYYFDDNYR GAVEWKELDGEMHYFSPETGKAFKGLNQIGDYKYYFNSDGVMQKGFV SINDNKHYFDDSGVMKVGYTEIDGKHFYFAENGEMQIGVFNTEDGFKY FAHHNEDLGNEEGEEISYSGILNFNNKIYYFDDSFTAVVGWKDLEDGSK YYFDEDTAEAYIGLSLINDGQYYFNDDGIMQVGFVTINDKVFYFSDSGII ESGVQNIDDNYFYIDDNGIVQIGVFDTSDGYKYFAPANTVNDNIYGQAV EYSGLVRVGEDVYYFGETYTIETGWIYDMENESDKYYFNPETKKACKGI NLIDDIKYYFDEKGIMRTGLISFENNNYYFNENGEMQFGYINIEDKMFYF GEDGVMQIGVFNTPDGFKYFAHQNTLDENFEGESINYTGWLDLDEKRY YFTDEYIAATGSVIIDGEEYYFDPDTAQLVISE C-TAB.G5.1 3 CCATGGTTACAGGTGTTTTCAAAGGTCCGAACGGCTTTGAATATTTTG (nucleic CACCGGCAAATACCCACAATAATAATATTGAAGGCCAGGCCATCGTG acid TATCAGAATAAATTTCTGACCCTGAACGGCAAAAAATACTATTTCGA sequence) TAACGATAGCAAAGCAGTTACCGGTTGGCAAACCATTGATGGCAAA AAATATTACTTCAACCTGAATACCGCAGAAGCAGCAACCGGCTGGCA GACGATCGACGGTAAAAAGTACTATTTTAACCTGAACACAGCCGAA GCCGCTACAGGCTGGCAGACAATAGATGGGAAGAAGTATTATTTTAA TACCAATACCTTTATTGCCAGCACCGGCTATACCAGCATTAATGGCA AACACTTCTATTTTAACACCGATGGTATTATGCAGATCGGTGTGTTTA AGGGCCCTAATGGTTTTGAGTACTTCGCTCCGGCTAATACCGATGCA AATAACATCGAAGGTCAGGCAATTCTGTACCAGAACAAATTTTTAAC GCTGAACGGTAAGAAATATTACTTTGGTAGCGATTCAAAAGCCGTTA CCGGTCTGCGTACGATCGACGGCAAGAAATATTATTTCAATACAAAC ACCGCAGTTGCCGTGACAGGTTGGCAGACGATAAATGGTAAGAAGT ACTACTTCAACACCAATACCAGCATTGCAAGTACCGGTTATACCATT ATCAGCGGCAAACACTTTTACTTCAATACAGACGGCATTATGCAGAT TGGCGTTTTCAAAGGTCCGGATGGTTTCGAGTACTTTGCCCCTGCAA ATACAGATGCAAACAATATTGAGGGACAGGCAATTCGCTATCAGAA TCGTTTTCTGTATCTGCACGATAACATCTATTACTTCGGCAATAATTC AAAAGCAGCCACCGGTTGGGTTACAATTGATGGTAATCGTTATTACT TTGAGCCGAATACCGCAATGGGTGCAAATGGTTATAAAACCATCGAT AACAAAAATTTTTATTTCCGCAACGGTCTGCCGCAGATTGGTGTTTTT AAGGGTAGCAATGGCTTCGAGTATTTTGCGCCAGCCAACACCGATGC CAACAACATTGAAGGCCAAGCGATTCGTTATCAAAACCGCTTTCTGC ATCTGCTGGGCAAAATTTATTACTTTGGCAACAATAGCAAAGCGGTG ACGGGCTGGCAAACCATTAACGGTAAAGTTTATTATTTCATGCCGGA TACCGCTATGGCAGCAGCCGGTGGTCTGTTTGAAATTGATGGCGTGA TTTATTTTTTTGGCGTGGATGGTGTTAAAGCACCGGGTATTTATGGTC GTAGCATGCATAATCTGATTACCGGTTTTGTTACCGTGGGCGACGAT AAATACTACTTTAATCCGATTAATGGTGGTGCAGCAAGCATTGGTGA AACCATTATCGATGACAAAAACTATTATTTTAACCAGAGCGGTGTTC TGCAGACAGGTGTTTTTAGCACCGAAGATGGCTTCAAATATTTTGCT CCTGCGAATACACTGGATGAAAATCTGGAAGGTGAAGCAATTGATTT TACCGGCAAACTGATCATCGACGAGAACATCTACTATTTTGATGATA ATTATCGCGGTGCCGTGGAATGGAAAGAACTGGATGGTGAAATGCA CTATTTTAGTCCGGAAACCGGTAAAGCCTTTAAAGGTCTGAATCAGA TCGGCGATTACAAGTATTACTTTAATTCAGATGGCGTGATGCAGAAA GGCTTTGTGAGCATTAACGACAACAAACACTATTTTGACGACAGCGG TGTGATGAAAGTGGGTTATACCGAAATCGACGGGAAACATTTTTATT TTGCCGAAAACGGCGAAATGCAGATTGGAGTATTTAATACCGAGGA CGGCTTTAAATACTTTGCCCATCATAATGAAGATCTGGGTAATGAAG AAGGCGAAGAAATTAGCTATAGCGGCATTCTGAATTTTAATAACAAG ATCTATTATTTCGATGATAGCTTCACCGCAGTTGTTGGTTGGAAAGAT CTGGAAGATGGCAGCAAATATTATTTTGATGAAGATACCGCAGAGGC CTATATTGGTCTGAGCCTGATTAATGATGGCCAGTATTATTTCAACGA TGATGGTATCATGCAGGTTGGTTTTGTGACCATCAACGATAAAGTGT TCTATTTCAGCGATAGCGGCATTATTGAAAGCGGTGTTCAGAACATC GACGATAACTATTTCTACATCGATGATAACGGTATTGTTCAGATTGG CGTGTTTGATACCTCCGATGGTTATAAATATTTCGCACCAGCCAATAC CGTGAACGATAATATTTATGGTCAGGCAGTTGAATATTCAGGTCTGG TTCGTGTTGGCGAAGATGTTTATTATTTTGGCGAAACCTATACCATTG AAACCGGCTGGATCTATGATATGGAAAACGAGAGCGACAAGTACTA TTTCAATCCGGAAACGAAAAAAGCCTGCAAAGGCATTAATCTGATCG ACGATATTAAGTACTACTTTGACGAAAAAGGCATTATGCGTACCGGT CTGATTAGCTTTGAGAACAACAACTATTACTTCAATGAGAACGGTGA GATGCAGTTTGGCTATATCAACATCGAGGACAAAATGTTTTATTTTG GTGAGGACGGTGTGATGCAGATAGGGGTTTTTAATACACCGGATGGG TTTAAGTATTTTGCACATCAGAACACCCTGGATGAAAACTTTGAAGG CGAAAGCATTAATTATACCGGTTGGCTGGATCTGGATGAGAAACGTT ATTATTTCACCGACGAATACATTGCAGCAACCGGTAGCGTTATTATT GATGGTGAGGAATATTACTTCGATCCGGATACAGCACAGCTGGTTAT TAGCGAATAACTCGAG C-TAB.G5.1 4 VTGVFKGPNGFEYFAPANTHNNNIEGQAIVYQNKFLTLNGKKYYFDND (amino SKAVTGWQTIDGKKYYFNLNTAEAATGWQTIDGKKYYFNLNTAEAAT acid GWQTIDGKKYYFNTNTFIASTGYTSINGKHFYFNTDGIMQIGVFKGPNG sequence) FEYFAPANTDANNIEGQAILYQNKFLTLNGKKYYFGSDSKAVTGLRTID GKKYYFNTNTAVAVTGWQTINGKKYYFNTNTSIASTGYTIISGKHFYFN TDGIMQIGVFKGPDGFEYFAPANTDANNIEGQAIRYQNRFLYLHDNIYY FGNNSKAATGWVTIDGNRYYFEPNTAMGANGYKTIDNKNFYFRNGLPQ IGVFKGSNGFEYFAPANTDANNIEGQAIRYQNRFLHLLGKIYYFGNNSK AVTGWQTINGKVYYFMPDTAMAAAGGLFEIDGVIYFFGVDGVKAPGIY GRSMHNLITGFVTVGDDKYYFNPINGGAASIGETIIDDKNYYFNQSGVL QTGVFSTEDGFKYFAPANTLDENLEGEAIDFTGKLIIDENIYYFDDNYRG AVEWKELDGEMHYFSPETGKAFKGLNQIGDYKYYFNSDGVMQKGFVSI NDNKHYFDDSGVMKVGYTEIDGKHFYFAENGEMQIGVFNTEDGFKYF AHHNEDLGNEEGEEISYSGILNFNNKIYYFDDSFTAVVGWKDLEDGSKY YFDEDTAEAYIGLSLINDGQYYFNDDGIMQVGFVTINDKVFYFSDSGIIE SGVQNIDDNYFYIDDNGIVQIGVFDTSDGYKYFAPANTVNDNIYGQAVE YSGLVRVGEDVYYFGETYTIETGWIYDMENESDKYYFNPETKKACKGI NLIDDIKYYFDEKGIMRTGLISFENNNYYFNENGEMQFGYINIEDKMFYF GEDGVMQIGVFNTPDGFKYFAHQNTLDENFEGESINYTGWLDLDEKRY YFTDEYIAATGSVIIDGEEYYFDPDTAQLVISE Nucleic 5 atgtctttaatatctaaagaagagttaataaaactcgcatatagcattagaccaagagaaaatgagtataaaactatac acid taactaatttagacgaatataataagttaactacaaacaataatgaaaataaatatttacaattaaaaaaactaaatgaa sequence tcaattgatgtttttatgaataaatataaaacttcaagcagaaatagagcactctctaatctaaaaaaagatatattaaaa of trdA gaagtaattcttattaaaaattccaatacaagccctgtagaaaaaaatttacattttgtatggataggtggagaagtcag (strain tgatattgctcttgaatacataaaacaatgggctgatattaatgcagaatataatattaaactgtggtatgatagtgaag 630) cattcttagtaaatacactaaaaaaggctatagttgaatcttctaccactgaagcattacagctactagaggaagagat tcaaaatcctcaatttgataatatgaaattttacaaaaaaaggatggaatttatatatgatagacaaaaaaggtttataaa ttattataaatctcaaatcaataaacctacagtacctacaatagatgatattataaagtctcatctagtatctgaatataat agagatgaaactgtattagaatcatatagaacaaattctttgagaaaaataaatagtaatcatgggatagatatcagg gctaatagtttgtttacagaacaagagttattaaatatttatagtcaggagttgttaaatcgtggaaatttagctgcagca tctgacatagtaagattattagccctaaaaaattttggcggagtatatttagatgttgatatgcttccaggtattcactctg atttatttaaaacaatatctagacctagctctattggactagaccgttgggaaatgataaaattagaggctattatgaag tataaaaaatatataaataattatacatcagaaaactttgataaacttgatcaacaattaaaagataattttaaactcatta tagaaagtaaaagtgaaaaatctgagatattttctaaattagaaaatttaaatgtatctgatcttgaaattaaaatagcttt cgctttaggcagtgttataaatcaagccttgatatcaaaacaaggttcatatcttactaacctagtaatagaacaagtaa aaaatagatatcaatttttaaaccaacaccttaacccagccatagagtctgataataacttcacagatactactaaaatt tttcatgattcattatttaattcagctaccgcagaaaactctatgtttttaacaaaaatagcaccatacttacaagtaggttt tatgccagaagctcgctccacaataagtttaagtggtccaggagcttatgcgtcagcttactatgatttcataaatttac aagaaaatactatagaaaaaactttaaaagcatcagatttaatagaatttaaattcccagaaaataatctatctcaattg acagaacaagaaataaatagtctatggagctttgatcaagcaagtgcaaaatatcaatttgagaaatatgtaagagat tatactggtggatctctttctgaagacaatggggtagactttaataaaaatactgccctcgacaaaaactatttattaaat aataaaattccatcaaacaatgtagaagaagctggaagtaaaaattatgttcattatatcatacagttacaaggagatg atataagttatgaagcaacatgcaatttattttctaaaaatcctaaaaatagtattattatacaacgaaatatgaatgaaa gtgcaaaaagctactttttaagtgatgatggagaatctattttagaattaaataaatataggatacctgaaagattaaaa aataaggaaaaagtaaaagtaacctttattggacatggtaaagatgaattcaacacaagcgaatttgctagattaagt gtagattcactttccaatgagataagttcatttttagataccataaaattagatatatcacctaaaaatgtagaagtaaac ttacttggatgtaatatgtttagttatgattttaatgttgaagaaacttatcctgggaagttgctattaagtattatggacaa aattacttccactttacctgatgtaaataaaaattctattactataggagcaaatcaatatgaagtaagaattaatagtga gggaagaaaagaacttctggctcactcaggtaaatggataaataaagaagaagctattatgagcgatttatctagta aagaatacattttttttgattctatagataataagctaaaagcaaagtccaagaatattccaggattagcatcaatatca gaagatataaaaacattattacttgatgcaagtgttagtcctgatacaaaatttattttaaataatcttaagcttaatattga atcttctattggtgattacatttattatgaaaaattagagcctgttaaaaatataattcacaattctatagatgatttaataga tgagttcaatctacttgaaaatgtatctgatgaattatatgaattaaaaaaattaaataatctagatgagaagtatttaata tcttttgaagatatctcaaaaaataattcaacttactctgtaagatttattaacaaaagtaatggtgagtcagtttatgtag aaacagaaaaagaaattttttcaaaatatagcgaacatattacaaaagaaataagtactataaagaatagtataattac agatgttaatggtaatttattggataatatacagttagatcatacttctcaagttaatacattaaacgcagcattctttattc aatcattaatagattatagtagcaataaagatgtactgaatgatttaagtacctcagttaaggttcaactttatgctcaac tatttagtacaggtttaaatactatatatgactctatccaattagtaaatttaatatcaaatgcagtaaatgatactataaat gtactacctacaataacagaggggatacctattgtatctactatattagacggaataaacttaggtgcagcaattaag gaattactagacgaacatgacccattactaaaaaaagaattagaagctaaggtgggtgttttagcaataaatatgtca ttatctatagctgcaactgtagcttcaattgttggaataggtgctgaagttactattttcttattacctatagctggtatatct gcaggaataccttcattagttaataatgaattaatattgcatgataaggcaacttcagtggtaaactattttaatcatttgt ctgaatctaaaaaatatggccctcttaaaacagaagatgataaaattttagttcctattgatgatttagtaatatcagaaa tagattttaataataattcgataaaactaggaacatgtaatatattagcaatggaggggggatcaggacacacagtga ctggtaatatagatcactttttctcatctccatctataagttctcatattccttcattatcaatttattctgcaataggtataga aacagaaaatctagatttttcaaaaaaaataatgatgttacctaatgctccttcaagagtgttttggtgggaaactgga gcagttccaggtttaagatcattggaaaatgacggaactagattacttgattcaataagagatttatacccaggtaaat tttactggagattctatgcttttttcgattatgcaataactacattaaaaccagtttatgaagacactaatattaaaattaaa ctagataaagatactagaaacttcataatgccaactataactactaacgaaattagaaacaaattatcttattcatttgat ggagcaggaggaacttactctttattattatcttcatatccaatatcaacgaatataaatttatctaaagatgatttatgga tatttaatattgataatgaagtaagagaaatatctatagaaaatggtactattaaaaaaggaaagttaataaaagatgttt taagtaaaattgatataaataaaaataaacttattataggcaatcaaacaatagatttttcaggcgatatagataataaa gatagatatatattcttgacttgtgagttagatgataaaattagtttaataatagaaataaatcttgttgcaaaatcttatag tttgttattgtctggggataaaaattatttgatatccaatttatctaatattattgagaaaatcaatactttaggcctagatag taaaaatatagcgtacaattacactgatgaatctaataataaatattttggagctatatctaaaacaagtcaaaaaagca taatacattataaaaaagacagtaaaaatatattagaattttataatgacagtacattagaatttaacagtaaagattttat tgctgaagatataaatgtatttatgaaagatgatattaatactataacaggaaaatactatgttgataataatactgataa aagtatagatttctctatttctttagttagtaaaaatcaagtaaaagtaaatggattatatttaaatgaatccgtatactcat cttaccttgattttgtgaaaaattcagatggacaccataatacttctaattttatgaatttatttttggacaatataagtttctg gaaattgtttgggtttgaaaatataaattttgtaatcgataaatactttacccttgttggtaaaactaatcttggatatgtag aatttatttgtgacaataataaaaatatagatatatattttggtgaatggaaaacatcgtcatctaaaagcactatatttag cggaaatggtagaaatgttgtagtagagcctatatataatcctgatacgggtgaagatatatctacttcactagattttt cctatgaacctctctatggaatagatagatatatcaataaagtattgatagcacctgatttatatacaagtttaataaatat taataccaattattattcaaatgagtactaccctgagattatagttcttaacccaaatacattccacaaaaaagtaaatat aaatttagatagttcttcttttgagtataaatggtctacagaaggaagtgactttattttagttagatacttagaagaaagt aataaaaaaatattacaaaaaataagaatcaaaggtatcttatctaatactcaatcatttaataaaatgagtatagatttt aaagatattaaaaaactatcattaggatatataatgagtaattttaaatcatttaattctgaaaatgaattagatagagatc atttaggatttaaaataatagataataaaacttattactatgatgaagatagtaaattagttaaaggattaatcaatataaa taattcattattctattttgatcctatagaatttaacttagtaactggatggcaaactatcaatggtaaaaaatattattttga tataaatactggagcagctttaattagttataaaattattaatggtaaacacttttattttaataatgatggtgtgatgcagt tgggagtatttaaaggacctgatggatttgaatattttgcacctgccaatactcaaaataataacatagaaggtcaggc tatagtttatcaaagtaaattcttaactttgaatggcaaaaaatattattttgataatgactcaaaagcagtcactggatg gagaattattaacaatgagaaatattactttaatcctaataatgctattgctgcagtcggattgcaagtaattgacaataa taagtattatttcaatcctgacactgctatcatctcaaaaggttggcagactgttaatggtagtagatactactttgatact gataccgctattgcctttaatggttataaaactattgatggtaaacacttttattttgatagtgattgtgtagtgaaaatag gtgtgtttagtacctctaatggatttgaatattttgcacctgctaatacttataataataacatagaaggtcaggctatagt ttatcaaagtaaattcttaactttgaatggtaaaaaatattactttgataataactcaaaagcagttaccggatggcaaa ctattgatagtaaaaaatattactttaatactaacactgctgaagcagctactggatggcaaactattgatggtaaaaa atattactttaatactaacactgctgaagcagctactggatggcaaactattgatggtaaaaaatattactttaatactaa cactgctatagcttcaactggttatacaattattaatggtaaacatttttattttaatactgatggtattatgcagataggag tgtttaaaggacctaatggatttgaatattttgcacctgctaatacggatgctaacaacatagaaggtcaagctatactt taccaaaatgaattcttaactttgaatggtaaaaaatattactttggtagtgactcaaaagcagttactggatggagaat tattaacaataagaaatattactttaatcctaataatgctattgctgcaattcatctatgcactataaataatgacaagtatt actttagttatgatggaattcttcaaaatggatatattactattgaaagaaataatttctattttgatgctaataatgaatcta aaatggtaacaggagtatttaaaggacctaatggatttgagtattttgcacctgctaatactcacaataataacataga aggtcaggctatagtttaccagaacaaattcttaactttgaatggcaaaaaatattattttgataatgactcaaaagcag ttactggatggcaaaccattgatggtaaaaaatattactttaatcttaacactgctgaagcagctactggatggcaaac tattgatggtaaaaaatattactttaatcttaacactgctgaagcagctactggatggcaaactattgatggtaaaaaat attactttaatactaacactttcatagcctcaactggttatacaagtattaatggtaaacatttttattttaatactgatggta ttatgcagataggagtgtttaaaggacctaatggatttgaatactttgcacctgctaatactcataataataacatagaa ggtcaagctatactttaccaaaataaattcttaactttgaatggtaaaaaatattactttggtagtgactcaaaagcagtt accggattgcgaactattgatggtaaaaaatattactttaatactaacactgctgttgcagttactggatggcaaactat taatggtaaaaaatactactttaatactaacacttctatagcttcaactggttatacaattattagtggtaaacatttttatttt aatactgatggtattatgcagataggagtgtttaaaggacctgatggatttgaatactttgcacctgctaatacagatg ctaacaatatagaaggtcaagctatacgttatcaaaatagattcctatatttacatgacaatatatattattttggtaataat tcaaaagcagctactggttgggtaactattgatggtaatagatattacttcgagcctaatacagctatgggtgcgaatg gttataaaactattgataataaaaatttttactttagaaatggtttacctcagataggagtgtttaaagggtctaatggattt gaatactttgcacctgctaatacggatgctaacaatatagaaggtcaagctatacgttatcaaaatagattcctacattt acttggaaaaatatattactttggtaataattcaaaagcagttactggatggcaaactattaatggtaaagtatattacttt atgcctgatactgctatggctgcagctggtggacttttcgagattgatggtgttatatatttctttggtgttgatggagta aaagcccctgggatatatggctaa Amino acid 6 MSLISKEELIKLAYSIRPRENEYKTILTNLDEYNKLTTNNNENKYLQLKK sequence LNESIDVFMNKYKTSSRNRALSNLKKDILKEVILIKNSNTSPVEKNLHFV of trdA WIGGEVSDIALEYIKQWADINAEYNIKLWYDSEAFLVNTLKKAIVESSTT (strain EALQLLEEEIQNPQFDNMKFYKKRMEFIYDRQKRFINYYKSQINKPTVPT 630) IDDIIKSHLVSEYNRDETVLESYRTNSLRKINSNHGIDIRANSLFTEQELLN IYSQELLNRGNLAAASDIVRLLALKNFGGVYLDVDMLPGIHSDLFKTISR PSSIGLDRWEMIKLEAIMKYKKYINNYTSENFDKLDQQLKDNFKLIIESK SEKSEIFSKLENLNVSDLEIKIAFALGSVINQALISKQGSYLTNLVIEQVKN RYQFLNQHLNPAIESDNNFTDTTKIFHDSLFNSATAENSMFLTKIAPYLQ VGFMPEARSTISLSGPGAYASAYYDFINLQENTIEKTLKASDLIEFKFPEN NLSQLTEQEINSLWSFDQASAKYQFEKYVRDYTGGSLSEDNGVDFNKN TALDKNYLLNNKIPSNNVEEAGSKNYVHYIIQLQGDDISYEATCNLFSK NPKNSIIIQRNMNESAKSYFLSDDGESILELNKYRIPERLKNKEKVKVTFI GHGKDEFNTSEFARLSVDSLSNEISSFLDTIKLDISPKNVEVNLLGCNMFS YDFNVEETYPGKLLLSIMDKITSTLPDVNKNSITIGANQYEVRINSEGRKE LLAHSGKWINKEEAIMSDLSSKEYIFFDSIDNKLKAKSKNIPGLASISEDI KTLLLDASVSPDTKFILNNLKLNIESSIGDYIYYEKLEPVKNIIHNSIDDLI DEFNLLENVSDELYELKKLNNLDEKYLISFEDISKNNSTYSVRFINKSNG ESVYVETEKEIFSKYSEHITKEISTIKNSIITDVNGNLLDNIQLDHTSQVNT LNAAFFIQSLIDYSSNKDVLNDLSTSVKVQLYAQLFSTGLNTIYDSIQLV NLISNAVNDTINVLPTITEGIPIVSTILDGINLGAAIKELLDEHDPLLKKELE AKVGVLAINMSLSIAATVASIVGIGAEVTIFLLPIAGISAGIPSLVNNELIL HDKATSVVNYFNHLSESKKYGPLKTEDDKILVPIDDLVISEIDFNNNSIKL GTCNILAMEGGSGHTVTGNIDHFFSSPSISSHIPSLSIYSAIGIETENLDFSK KIMMLPNAPSRVFWWETGAVPGLRSLENDGTRLLDSIRDLYPGKFYWR FYAFFDYAITTLKPVYEDTNIKIKLDKDTRNFIMPTITTNEIRNKLSYSFD GAGGTYSLLLSSYPISTNINLSKDDLWIFNIDNEVREISIENGTIKKGKLIK DVLSKIDINKNKLIIGNQTIDFSGDIDNKDRYIFLTCELDDKISLIIEINLVA KSYSLLLSGDKNYLISNLSNIIEKINTLGLDSKNIAYNYTDESNNKYFGAI SKTSQKSIIHYKKDSKNILEFYNDSTLEFNSKDFIAEDINVFMKDDINTITG KYYVDNNTDKSIDFSISLVSKNQVKVNGLYLNESVYSSYLDFVKNSDGH HNTSNFMNLFLDNISFWKLFGFENINFVIDKYFTLVGKTNLGYVEFICDN NKNIDIYFGEWKTSSSKSTIFSGNGRNVVVEPIYNPDTGEDISTSLDFSYE PLYGIDRYINKVLIAPDLYTSLININTNYYSNEYYPEIIVLNPNTFHKKVNI NLDSSSFEYKWSTEGSDFILVRYLEESNKKILQKIRIKGILSNTQSFNKMSI DFKDIKKLSLGYIMSNFKSFNSENELDRDHLGFKIIDNKTYYYDEDSKLV KGLININNSLFYFDPIEFNLVTGWQTINGKKYYFDINTGAALISYKIINGK HFYFNNDGVMQLGVFKGPDGFEYFAPANTQNNNIEGQAIVYQSKFLTL NGKKYYFDNDSKAVTGWRIINNEKYYFNPNNAIAAVGLQVIDNNKYYF NPDTAIISKGWQTVNGSRYYFDTDTAIAFNGYKTIDGKHFYFDSDCVVK IGVFSTSNGFEYFAPANTYNNNIEGQAIVYQSKFLTLNGKKYYFDNNSK AVTGWQTIDSKKYYFNTNTAEAATGWQTIDGKKYYFNTNTAEAATGW QTIDGKKYYFNTNTAIASTGYTIINGKHFYFNTDGIMQIGVFKGPNGFEY FAPANTDANNIEGQAILYQNEFLTLNGKKYYFGSDSKAVTGWRIINNKK YYFNPNNAIAAIHLCTINNDKYYFSYDGILQNGYITIERNNFYFDANNES KMVTGVFKGPNGFEYFAPANTHNNNIEGQAIVYQNKFLTLNGKKYYFD NDSKAVTGWQTIDGKKYYFNLNTAEAATGWQTIDGKKYYFNLNTAEA ATGWQTIDGKKYYFNTNTFIASTGYTSINGKHFYFNTDGIMQIGVFKGP NGFEYFAPANTHNNNIEGQAILYQNKFLTLNGKKYYFGSDSKAVTGLRT IDGKKYYFNTNTAVAVTGWQTINGKKYYFNTNTSIASTGYTIISGKHFYF NTDGIMQIGVFKGPDGFEYFAPANTDANNIEGQAIRYQNRFLYLHDNIY YFGNNSKAATGWVTIDGNRYYFEPNTAMGANGYKTIDNKNFYFRNGLP QIGVFKGSNGFEYFAPANTDANNIEGQAIRYQNRFLHLLGKIYYFGNNS KAVTGWQTINGKVYYFMPDTAMAAAGGLFEIDGVIYFFGVDGVKAPGI YG Nucleic 7 atgagtttagttaatagaaaacagttagaaaaaatggcaaatgtaagatttcgtactcaagaagatgaatatgttgcaa acid tattggatgctttagaagaatatcataatatgtcagagaatactgtagtcgaaaaatatttaaaattaaaagatataaata sequence gtttaacagatatttatatagatacatataaaaaatctggtagaaataaagccttaaaaaaatttaaggaatatctagtta of trdB cagaagtattagagctaaagaataataatttaactccagttgagaaaaatttacattttgtttggattggaggtcaaata (strain aatgacactgctattaattatataaatcaatggaaagatgtaaatagtgattataatgttaatgttttttatgatagtaatgc 630) atttttgataaacacattgaaaaaaactgtagtagaatcagcaataaatgatacacttgaatcatttagagaaaacttaa atgaccctagatttgactataataaattcttcagaaaacgtatggaaataatttatgataaacagaaaaatttcataaact actataaagctcaaagagaagaaaatcctgaacttataattgatgatattgtaaagacatatctttcaaatgagtattca aaggagatagatgaacttaatacctatattgaagaatccttaaataaaattacacagaatagtggaaatgatgttagaa actttgaagaatttaaaaatggagagtcattcaacttatatgaacaagagttggtagaaaggtggaatttagctgctgc ttctgacatattaagaatatctgcattaaaagaaattggtggtatgtatttagatgttgatatgttaccaggaatacaacc agacttatttgagtctatagagaaacctagttcagtaacagtggatttttgggaaatgacaaagttagaagctataatg aaatacaaagaatatataccagaatatacctcagaacattttgacatgttagacgaagaagttcaaagtagttttgaat ctgttctagcttctaagtcagataaatcagaaatattctcatcacttggtgatatggaggcatcaccactagaagttaaa attgcatttaatagtaagggtattataaatcaagggctaatttctgtgaaagactcatattgtagcaatttaatagtaaaa caaatcgagaatagatataaaatattgaataatagtttaaatccagctattagcgaggataatgattttaatactacaac gaatacctttattgatagtataatggctgaagctaatgcagataatggtagatttatgatggaactaggaaagtatttaa gagttggtttcttcccagatgttaaaactactattaacttaagtggccctgaagcatatgcggcagcttatcaagatttat taatgtttaaagaaggcagtatgaatatccatttgatagaagctgatttaagaaactttgaaatctctaaaactaatattt ctcaatcaactgaacaagaaatggctagcttatggtcatttgacgatgcaagagctaaagctcaatttgaagaatata aaaggaattattttgaaggttctcttggtgaagatgataatcttgatttttctcaaaatatagtagttgacaaggagtatct tttagaaaaaatatcttcattagcaagaagttcagagagaggatatatacactatattgttcagttacaaggagataaa attagttatgaagcagcatgtaacttatttgcaaagactccttatgatagtgtactgtttcagaaaaatatagaagattca gaaattgcatattattataatcctggagatggtgaaatacaagaaatagacaagtataaaattccaagtataatttctga tagacctaagattaaattaacatttattggtcatggtaaagatgaatttaatactgatatatttgcaggttttgatgtagatt cattatccacagaaatagaagcagcaatagatttagctaaagaggatatttctcctaagtcaatagaaataaatttatta ggatgtaatatgtttagctactctatcaacgtagaggagacttatcctggaaaattattacttaaagttaaagataaaat atcagaattaatgccatctataagtcaagactctattatagtaagtgcaaatcaatatgaagttagaataaatagtgaa ggaagaagagaattattggatcattctggtgaatggataaataaagaagaaagtattataaaggatatttcatcaaaa gaatatatatcatttaatcctaaagaaaataaaattacagtaaaatctaaaaatttacctgagctatctacattattacaa gaaattagaaataattctaattcaagtgatattgaactagaagaaaaagtaatgttaacagaatgtgagataaatgttat ttcaaatatagatacgcaaattgttgaggaaaggattgaagaagctaagaatttaacttctgactctattaattatataaa agatgaatttaaactaatagaatctatttctgatgcactatgtgacttaaaacaacagaatgaattagaagattctcatttt atatcttttgaggacatatcagagactgatgagggatttagtataagatttattaataaagaaactggagaatctatattt gtagaaactgaaaaaacaatattctctgaatatgctaatcatataactgaagagatttctaagataaaaggtactatatt tgatactgtaaatggtaagttagtaaaaaaagtaaatttagatactacacacgaagtaaatactttaaatgctgcattttt tatacaatcattaatagaatataatagttctaaagaatctcttagtaatttaagtgtagcaatgaaagtccaagtttacgct caattatttagtactggtttaaatactattacagatgcagccaaagttgttgaattagtatcaactgcattagatgaaact atagacttacttcctacattatctgaaggattacctataattgcaactattatagatggtgtaagtttaggtgcagcaatc aaagagctaagtgaaacgagtgacccattattaagacaagaaatagaagctaagataggtataatggcagtaaattt aacaacagctacaactgcaatcattacttcatctttggggatagctagtggatttagtatacttttagttcctttagcagg aatttcagcaggtataccaagcttagtaaacaatgaacttgtacttcgagataaggcaacaaaggttgtagattatttta aacatgtttcattagttgaaactgaaggagtatttactttattagatgataaaataatgatgccacaagatgatttagtga tatcagaaatagattttaataataattcaatagttttaggtaaatgtgaaatctggagaatggaaggtggttcaggtcat actgtaactgatgatatagatcacttcttttcagcaccatcaataacatatagagagccacacttatctatatatgacgta ttggaagtacaaaaagaagaacttgatttgtcaaaagatttaatggtattacctaatgctccaaatagagtatttgcttg ggaaacaggatggacaccaggtttaagaagcttagaaaatgatggcacaaaactgttagaccgtataagagataa ctatgaaggtgagttttattggagatattttgcttttatagctgatgctttaataacaacattaaaaccaagatatgaagat actaatataagaataaatttagatagtaatactagaagttttatagttccaataataactacagaatatataagagaaaa attatcatattctttctatggttcaggaggaacttatgcattgtctctttctcaatataatatgggtataaatatagaattaag tgaaagtgatgtttggattatagatgttgataatgttgtgagagatgtaactatagaatctgataaaattaaaaaaggtg atttaatagaaggtattttatctacactaagtattgaagagaataaaattatcttaaatagccatgagattaatttttctggt gaggtaaatggaagtaatggatttgtttctttaacattttcaattttagaaggaataaatgcaattatagaagttgatttatt atctaaatcatataaattacttatttctggcgaattaaaaatattgatgttaaattcaaatcatattcaacagaaaatagatt atataggattcaatagcgaattacagaaaaatataccatatagctttgtagatagtgaaggaaaagagaatggttttat taatggttcaacaaaagaaggtttatttgtatctgaattacctgatgtagttcttataagtaaggtttatatggatgatagt aagccttcatttggatattatagtaataatttgaaagatgtcaaagttataactaaagataatgttaatatattaacaggtt attatcttaaggatgatataaaaatctctctttctttgactctacaagatgaaaaaactataaagttaaatagtgtgcattt agatgaaagtggagtagctgagattttgaagttcatgaatagaaaaggtaatacaaatacttcagattctttaatgagc tttttagaaagtatgaatataaaaagtattttcgttaatttcttacaatctaatattaagtttatattagatgctaattttataat aagtggtactacttctattggccaatttgagtttatttgtgatgaaaatgataatatacaaccatatttcattaagtttaata cactagaaactaattatactttatatgtaggaaatagacaaaatatgatagtggaaccaaattatgatttagatgattctg gagatatatcttcaactgttatcaatttctctcaaaagtatctttatggaatagacagttgtgttaataaagttgtaatttca ccaaatatttatacagatgaaataaatataacgcctgtatatgaaacaaataatacttatccagaagttattgtattagat gcaaattatataaatgaaaaaataaatgttaatatcaatgatctatctatacgatatgtatggagtaatgatggtaatgat tttattcttatgtcaactagtgaagaaaataaggtgtcacaagttaaaataagattcgttaatgtttttaaagataagactt tggcaaataagctatcttttaactttagtgataaacaagatgtacctgtaagtgaaataatcttatcatttacaccttcata ttatgaggatggattgattggctatgatttgggtctagtttctttatataatgagaaattttatattaataactttggaatgat ggtatctggattaatatatattaatgattcattatattattttaaaccaccagtaaataatttgataactggatttgtgactgt aggcgatgataaatactactttaatccaattaatggtggagctgcttcaattggagagacaataattgatgacaaaaat tattatttcaaccaaagtggagtgttacaaacaggtgtatttagtacagaagatggatttaaatattttgccccagctaat acacttgatgaaaacctagaaggagaagcaattgattttactggaaaattaattattgacgaaaatatttattattttgat gataattatagaggagctgtagaatggaaagaattagatggtgaaatgcactattttagcccagaaacaggtaaagc ttttaaaggtctaaatcaaataggtgattataaatactatttcaattctgatggagttatgcaaaaaggatttgttagtata aatgataataaacactattttgatgattctggtgttatgaaagtaggttacactgaaatagatggcaagcatttctacttt gctgaaaacggagaaatgcaaataggagtatttaatacagaagatggatttaaatattttgctcatcataatgaagatt taggaaatgaagaaggtgaagaaatctcatattctggtatattaaatttcaataataaaatttactattttgatgattcattt acagctgtagttggatggaaagatttagaggatggttcaaagtattattttgatgaagatacagcagaagcatatata ggtttgtcattaataaatgatggtcaatattattttaatgatgatggaattatgcaagttggatttgtcactataaatgataa agtcttctacttctctgactctggaattatagaatctggagtacaaaacatagatgacaattatttctatatagatgataat ggtatagttcaaattggtgtatttgatacttcagatggatataaatattttgcacctgctaatactgtaaatgataatattta cggacaagcagttgaatatagtggtttagttagagttggtgaagatgtatattattttggagaaacatatacaattgaga ctggatggatatatgatatggaaaatgaaagtgataaatattatttcaatccagaaactaaaaaagcatgcaaaggta ttaatttaattgatgatataaaatattattttgatgagaagggcataatgagaacgggtcttatatcatttgaaaataataa ttattactttaatgagaatggtgaaatgcaatttggttatataaatatagaagataagatgttctattttggtgaagatggt gtcatgcagattggagtatttaatacaccagatggatttaaatactttgcacatcaaaatactttggatgagaattttgag ggagaatcaataaactatactggttggttagatttagatgaaaagagatattattttacagatgaatatattgcagcaac tggttcagttattattgatggtgaggagtattattttgatcctgatacagctcaattagtgattagtgaatag Amino acid 8 MSLVNRKQLEKMANVRFRTQEDEYVAILDALEEYHNMSENTVVEKYL sequence KLKDINSLTDIYIDTYKKSGRNKALKKFKEYLVTEVLELKNNNLTPVEK of trdB NLHFVWIGGQINDTAINYINQWKDVNSDYNVNVFYDSNAFLINTLKKT (strain VVESAINDTLESFRENLNDPRFDYNKFFRKRMEIIYDKQKNFINYYKAQR 630) EENPELIIDDIVKTYLSNEYSKEIDELNTYIEESLNKITQNSGNDVRNFEEF KNGESFNLYEQELVERWNLAAASDILRISALKEIGGMYLDVDMLPGIQP DLFESIEKPSSVTVDFWEMTKLEAIMKYKEYIPEYTSEHFDMLDEEVQSS FESVLASKSDKSEIFSSLGDMEASPLEVKIAFNSKGIINQGLISVKDSYCSN LIVKQIENRYKILNNSLNPAISEDNDFNTTTNTFIDSIMAEANADNGRFM MELGKYLRVGFFPDVKTTINLSGPEAYAAAYQDLLMFKEGSMNIHLIEA DLRNFEISKTNISQSTEQEMASLWSFDDARAKAQFEEYKRNYFEGSLGE DDNLDFSQNIVVDKEYLLEKISSLARSSERGYIHYIVQLQGDKISYEAAC NLFAKTPYDSVLFQKNIEDSEIAYYYNPGDGEIQEIDKYKIPSIISDRPKIK LTFIGHGKDEFNTDIFAGFDVDSLSTEIEAAIDLAKEDISPKSIEINLLGCN MFSYSINVEETYPGKLLLKVKDKISELMPSISQDSIIVSANQYEVRINSEG RRELLDHSGEWINKEESIIKDISSKEYISFNPKENKITVKSKNLPELSTLLQ EIRNNSNSSDIELEEKVMLTECEINVISNIDTQIVEERIEEAKNLTSDSINYI KDEFKLIESISDALCDLKQQNELEDSHFISFEDISETDEGFSIRFINKETGES IFVETEKTIFSEYANHITEEISKIKGTIFDTVNGKLVKKVNLDTTHEVNTL NAAFFIQSLIEYNSSKESLSNLSVAMKVQVYAQLFSTGLNTITDAAKVVE LVSTALDETIDLLPTLSEGLPIIATIIDGVSLGAAIKELSETSDPLLRQEIEA KIGIMAVNLTTATTAIITSSLGIASGFSILLVPLAGISAGIPSLVNNELVLRD KATKVVDYFKHVSLVETEGVFTLLDDKIMMPQDDLVISEIDFNNNSIVL GKCEIWRMEGGSGHTVTDDIDHFFSAPSITYREPHLSIYDVLEVQKEELD LSKDLMVLPNAPNRVFAWETGWTPGLRSLENDGTKLLDRIRDNYEGEF YWRYFAFIADALITTLKPRYEDTNIRINLDSNTRSFIVPIITTEYIREKLSYS FYGSGGTYALSLSQYNMGINIELSESDVWIIDVDNVVRDVTIESDKIKKG DLIEGILSTLSIEENKIILNSHEINFSGEVNGSNGFVSLTFSILEGINAIIEVD LLSKSYKLLISGELKILMLNSNHIQQKIDYIGFNSELQKNIPYSFVDSEGK ENGFINGSTKEGLFVSELPDVVLISKVYMDDSKPSFGYYSNNLKDVKVIT KDNVNILTGYYLKDDIKISLSLTLQDEKTIKLNSVHLDESGVAEILKFMN RKGNTNTSDSLMSFLESMNIKSIFVNFLQSNIKFILDANFIISGTTSIGQFEF ICDENDNIQPYFIKFNTLETNYTLYVGNRQNMIVEPNYDLDDSGDISSTVI NFSQKYLYGIDSCVNKVVISPNIYTDEINITPVYETNNTYPEVIVLDANYI NEKINVNINDLSIRYVWSNDGNDFILMSTSEENKVSQVKIRFVNVFKDKT LANKLSFNFSDKQDVPVSEIILSFTPSYYEDGLIGYDLGLVSLYNEKFYIN NFGMMVSGLIYINDSLYYFKPPVNNLITGFVTVGDDKYYFNPINGGAASI GETIIDDKNYYFNQSGVLQTGVFSTEDGFKYFAPANTLDENLEGEAIDFT GKLIIDENIYYFDDNYRGAVEWKELDGEMHYFSPETGKAFKGLNQIGDY KYYFNSDGVMQKGFVSINDNKHYFDDSGVMKVGYTEIDGKHFYFAEN GEMQIGVFNTEDGFKYFAHHNEDLGNEEGEEISYSGILNFNNKIYYFDDS FTAVVGWKDLEDGSKYYFDEDTAEAYIGLSLINDGQYYFNDDGIMQVG FVTINDKVFYFSDSGIIESGVQNIDDNYFYIDDNGIVQIGVFDTSDGYKYF APANTVNDNIYGQAVEYSGLVRVGEDVYYFGETYTIETGWIYDMENES DKYYFNPETKKACKGINLIDDIKYYFDEKGIMRTGLISFENNNYYFNENG EMQFGYINIEDKMFYFGEDGVMQIGVFNTPDGFKYFAHQNTLDENFEG ESINYTGWLDLDEKRYYFTDEYIAATGSVIIDGEEYYFDPDTAQLVISE Forward 9 caccACTAGTatgaacttagtaactggatggc primer Reverse 10 CTCGAGttagccatatatcccaggggc primer Forward 11 caccATGCATatgagtttagttaatagaaaacag primer Reverse 12 ggcCTCGAGctattcactaatcactaattgagc primer Forward 13 AGATCTATGCATGAGCTCctcgagcccaaaacgaaaggctcagc primer Reverse 14 cggtccggggccatatatcccaggggcttttactcc primer Forward 15 caccCCATTGatggtaacaggagtatttaaagga primer Reverse 16 CTCGAGctattcactaatcactaattgagctg primer C-TADCTB 17 atggtaacaggagtatttaaaggacctaatggatttgagtattttgcacctgctaatactcacaataataacatagaag (nucleic gtcaggctatagtttaccagaacaaattcttaactttgaatggcaaaaaatattattttgataatgactcaaaagcagtta acid ctggatggcaaaccattgatggtaaaaaatattactttaatcttaacactgctgaagcagctactggatggcaaactat sequence) tgatggtaaaaaatattactttaatcttaacactgctgaagcagctactggatggcaaactattgatggtaaaaaatatt actttaatactaacactttcatagcctcaactggttatacaagtattaatggtaaacatttttattttaatactgatggtatta tgcagataggagtgtttaaaggacctaatggatttgaatactttgcacctgctaatacggatgctaacaacatagaag gtcaagctatactttaccaaaataaattcttaactttgaatggtaaaaaatattactttggtagtgactcaaaagcagtta ccggactgcgaactattgatggtaaaaaatattactttaatactaacactgctgttgcagttactggatggcaaactatt aatggtaaaaaatactactttaatactaacacttctatagcttcaactggttatacaattattagtggtaaacatttttatttt aatactgatggtattatgcagataggagtgtttaaaggacctgatggatttgaatactttgcacctgctaatacagatg ctaacaatatagaaggtcaagctatacgttatcaaaatagattcctatatttacatgacaatatatattattttggtaataat tcaaaagcggctactggttgggtaactattgatggtaatagatattacttcgagcctaatacagctatgggtgcgaat ggttataaaactattgataataaaaatttttactttagaaatggtttacctcagataggagtgtttaaagggtctaatggat ttgaatactttgcacctgctaatacggatgctaacaatatagaaggtcaagctatacgttatcaaaatagattcctacat ttacttggaaaaatatattactttggtaataattcaaaagcagttactggatggcaaactattaatggtaaagtatattact ttatgcctgatactgctatggctgcagctggtggacttttcgagattgatggtgttatatatttctttggtgttgatggagt aaaagcccctgggatatatggcAGATCTATGCATaatttgataactggatttgtgactgtaggcgatgata aatactactttaatccaattaatggtggagctgcttcaattggagagacaataattgatgacaaaaattattatttcaacc aaagtggagtgttacaaacaggtgtatttagtacagaagatggatttaaatattttgccccagctaatacacttgatga aaacctagaaggagaagcaattgattttactggaaaattaattattgacgaaaatatttattattttgatgataattataga ggagctgtagaatggaaagaattagatggtgaaatgcactattttagcccagaaacaggtaaagcttttaaaggtct aaatcaaataggtgattataaatactatttcaattctgatggagttatgcaaaaaggatttgttagtataaatgataataa acactattttgatgattctggtgttatgaaagtaggttacactgaaatagatggcaagcatttctactttgctgaaaacg gagaaatgcaaataggagtatttaatacagaagatggatttaaatattttgctcatcataatgaagatttaggaaatga agaaggtgaagaaatctcatattctggtatattaaatttcaataataaaatttactattttgatgattcatttacagctgtag ttggatggaaagatttagaggatggttcaaagtattattttgatgaagatacagcagaagcatatataggtttgtcatta ataaatgatggtcaatattattttaatgatgatggaattatgcaagttggatttgtcactataaatgataaagtcttctactt ctctgactctggaattatagaatctggagtacaaaacatagatgacaattatttctatatagatgataatggtatagttca aattggtgtatttgatacttcagatggatataaatattttgcacctgctaatactgtaaatgataatatttacggacaagca gttgaatatagtggtttagttagagttggggaagatgtatattattttggagaaacatatacaattgagactggatggat atatgatatggaaaatgaaagtgataaatattatttcaatccagaaactaaaaaagcatgcaaaggtattaatttaattg atgatataaaatattattttgatgagaagggcataatgagaacgggtcttatatcatttgaaaataataattattactttaa tgagaatggtgaaatgcaatttggttatataaatatagaagataagatgttctattttggtgaagatggtgtcatgcaga ttggagtatttaatacaccagatggatttaaatactttgcacatcaaaatactttggatgagaattttgagggagaatca ataaactatactggttggttagatttagatgaaaagagatattattttacagatgaatatattgcagcaactggttcagtt attattgatggtgaggagtattattttgatcctgatacagctcaattagtgattagtgaaCTCGAGggattaatatat attaatgattcattatattattttaaaccaccagtaaataatttgataactggatttgtgactgtaggcgatgataaatacta ctttaatccaattaatggtggagctgcttcaattggagagacaataattgatgacaaaaattattatttcaaccaaagtg gagtgttacaaacaggtgtatttagtacagaagatggatttaaatattttgccccagctaatacacttgatgaaaaccta gaaggagaagcaattgattttactggaaaattaattattgacgaaaatatttattattttgatgataattatagaggagct gtagaatggaaagaattagatggtgaaatgcactattttagcccagaaacaggtaaagcttttaaaggtctaaatcaa ataggtgattataaatactatttcaattctgatggagttatgcaaaaaggatttgttagtataaatgataataaacactatt ttgatgattctggtgttatgaaagtaggttacactgaaatagatggcaagcatttctactttgctgaaaacggagaaat gcaaataggagtatttaatacagaagatggatttaaatattttgctcatcataatgaagatttaggaaatgaagaaggt gaagaaatctcatattctggtatattaaatttcaataataaaatttactattttgatgattcatttacagctgtagttggatgg aaagatttagaggatggttcaaagtattattttgatgaagatacagcagaagcatatataggtttgtcattaataaatga tggtcaatattattttaatgatgatggaattatgcaagttggatttgtcactataaatgataaagtcttctacttctctgact ctggaattatagaatctggagtacaaaacatagatgacaattatttctatatagatgataatggtatagttcaaattggtg tatttgatacttcagatggatataaatattttgcacctgctaatactgtaaatgataatatttacggacaagcagttgaata tagtggtttagttagagttggggaagatgtatattattttggagaaacatatacaattgagactggatggatatatgatat ggaaaatgaaagtgataaatattatttcaatccagaaactaaaaaagcatgcaaaggtattaatttaattgatgatataa aatattattttgatgagaagggcataatgagaacgggtcttatatcatttgaaaataataattattactttaatgagaatg gtgaaatgcaatttggttatataaatatagaagataagatgttctattttggtgaagatggtgtcatgcagattggagtat ttaatacaccagatggatttaaatactttgcacatcaaaatactttggatgagaattttgagggagaatcaataaactat actggttggttagatttagatgaaaagagatattattttacagatgaatatattgcagcaactggttcagttattattgatg gtgaggagtattattttgatcctgatacagctcaattagtgattagtgaatag C-TADCTB 18 MVTGVFKGPNGFEYFAPANTHNNNIEGQAIVYQNKFLTLNGKKYYFDN (amino DSKAVTGWQTIDGKKYYFNLNTAEAATGWQTIDGKKYYFNLNTAEAA acid TGWQTIDGKKYYFNTNTFIASTGYTSINGKHFYFNTDGIMQIGVFKGPN sequence) GFEYFAPANTDANNIEGQAILYQNKFLTLNGKKYYFGSDSKAVTGLRTI DGKKYYFNTNTAVAVTGWQTINGKKYYFNTNTSIASTGYTIISGKHFYF NTDGIMQIGVFKGPDGFEYFAPANTDANNIEGQAIRYQNRFLYLHDNIY YFGNNSKAATGWVTIDGNRYYFEPNTAMGANGYKTIDNKNFYFRNGLP QIGVFKGSNGFEYFAPANTDANNIEGQAIRYQNRFLHLLGKIYYFGNNS KAVTGWQTINGKVYYFMPDTAMAAAGGLFEIDGVIYFFGVDGVKAPGI YGRSMHNLITGFVTVGDDKYYFNPINGGAASIGETIIDDKNYYFNQSGV LQTGVFSTEDGFKYFAPANTLDENLEGEAIDFTGKLIIDENIYYFDDNYR GAVEWKELDGEMHYFSPETGKAFKGLNQIGDYKYYFNSDGVMQKGFV SINDNKHYFDDSGVMKVGYTEIDGKHFYFAENGEMQIGVFNTEDGFKY FAHHNEDLGNEEGEEISYSGILNFNNKIYYFDDSFTAVVGWKDLEDGSK YYFDEDTAEAYIGLSLINDGQYYFNDDGIMQVGFVTINDKVFYFSDSGII ESGVQNIDDNYFYIDDNGIVQIGVFDTSDGYKYFAPANTVNDNIYGQAV EYSGLVRVGEDVYYFGETYTIETGWIYDMENESDKYYFNPETKKACKGI NLIDDIKYYFDEKGIMRTGLISFENNNYYFNENGEMQFGYINIEDKMFYF GEDGVMQIGVFNTPDGFKYFAHQNTLDENFEGESINYTGWLDLDEKRY YFTDEYIAATGSVIIDGEEYYFDPDTAQLVISELEGLIYINDSLYYFKPPV NNLITGFVTVGDDKYYFNPINGGAASIGETIIDDKNYYFNQSGVLQTGVF STEDGFKYFAPANTLDENLEGEAIDFTGKLIIDENIYYFDDNYRGAVEW KELDGEMHYFSPETGKAFKGLNQIGDYKYYFNSDGVMQKGFVSINDNK HYFDDSGVMKVGYTEIDGKHFYFAENGEMQIGVFNTEDGFKYFAHHNE DLGNEEGEEISYSGILNFNNKIYYFDDSFTAVVGWKDLEDGSKYYFDED TAEAYIGLSLINDGQYYFNDDGIMQVGFVTINDKVFYFSDSGIIESGVQNI DDNYFYIDDNGIVQIGVFDTSDGYKYFAPANTVNDNIYGQAVEYSGLVR VGEDVYYFGETYTIETGWIYDMENESDKYYFNPETIU(ACKGINLIDDIK YYFDEKGIMRTGLISFENNNYYFNENGEMQFGYINIEDKMFYFGEDGV MQIGVFNTPDGFKYFAHQNTLDENFEGESINYTGWLDLDEKRYYFTDE YIAATGSVIIDGEEYYFDPDTAQLVISE C-TANCTB 19 atggtaacaggagtatttaaaggacctaatggatttgagtattttgcacctgctaatactcacaataataacatagaag (nucleic gtcaggctatagtttaccagaacaaattcttaactttgaatggcaaaaaatattattttgataatgactcaaaagcagtta acid ctggatggcaaaccattgatggtaaaaaatattactttaatcttaacactgctgaagcagctactggatggcaaactat sequence) tgatggtaaaaaatattactttaatcttaacactgctgaagcagctactggatggcaaactattgatggtaaaaaatatt actttaatactaacactttcatagcctcaactggttatacaagtattaatggtaaacatttttattttaatactgatggtatta tgcagataggagtgtttaaaggacctaatggatttgaatactttgcacctgctaatacggatgctaacaacatagaag gtcaagctatactttaccaaaataaattcttaactttgaatggtaaaaaatattactttggtagtgactcaaaagcagtta ccggactgcgaactattgatggtaaaaaatattactttaatactaacactgctgttgcagttactggatggcaaactatt aatggtaaaaaatactactttaatactaacacttctatagcttcaactggttatacaattattagtggtaaacatttttatttt aatactgatggtattatgcagataggagtgtttaaaggacctgatggatttgaatactttgcacctgctaatacagatg ctaacaatatagaaggtcaagctatacgttatcaaaatagattcctatatttacatgacaatatatattattttggtaataat tcaaaagcggctactggttgggtaactattgatggtaatagatattacttcgagcctaatacagctatgggtgcgaat ggttataaaactattgataataaaaatttttactttagaaatggtttacctcagataggagtgtttaaagggtctaatggat ttgaatactttgcacctgctaatacggatgctaacaatatagaaggtcaagctatacgttatcaaaatagattcctacat ttacttggaaaaatatattactttggtaataattcaaaagcagttactggatggcaaactattaatggtaaagtatattact ttatgcctgatactgctatggctgcagctggtggacttttcgagattgatggtgttatatatttctttggtgttgatggagt aaaagcccctgggatatatggcAGATCTATGCATaatttgataactggatttgtgactgtaggcgatgata aatactactttaatccaattaatggtggagctgcttcaattggagagacaataattgatgacaaaaattattatttcaacc aaagtggagtgttacaaacaggtgtatttagtacagaagatggatttaaatattttgccccagctaatacacttgatga aaacctagaaggagaagcaattgattttactggaaaattaattattgacgaaaatatttattattttgatgataattataga ggagctgtagaatggaaagaattagatggtgaaatgcactattttagcccagaaacaggtaaagcttttaaaggtct aaatcaaataggtgattataaatactatttcaattctgatggagttatgcaaaaaggatttgttagtataaatgataataa acactattttgatgattctggtgttatgaaagtaggttacactgaaatagatggcaagcatttctactttgctgaaaacg gagaaatgcaaataggagtatttaatacagaagatggatttaaatattttgctcatcataatgaagatttaggaaatga agaaggtgaagaaatctcatattctggtatattaaatttcaataataaaatttactattttgatgattcatttacagctgtag ttggatggaaagatttagaggatggttcaaagtattattttgatgaagatacagcagaagcatatataggtttgtcatta ataaatgatggtcaatattattttaatgatgatggaattatgcaagttggatttgtcactataaatgataaagtcttctactt ctctgactctggaattatagaatctggagtacaaaacatagatgacaattatttctatatagatgataatggtatagttca aattggtgtatttgatacttcagatggatataaatattttgcacctgctaatactgtaaatgataatatttacggacaagca gttgaatatagtggtttagttagagttggggaagatgtatattattttggagaaacatatacaattgagactggatggat atatgatatggaaaatgaaagtgataaatattatttcaatccagaaactaaaaaagcatgcaaaggtattaatttaattg atgatataaaatattattttgatgagaagggcataatgagaacgggtcttatatcatttgaaaataataattattactttaa tgagaatggtgaaatgcaatttggttatataaatatagaagataagatgttctattttggtgaagatggtgtcatgcaga ttggagtatttaatacaccagatggatttaaatactttgcacatcaaaatactttggatgagaattttgagggagaatca ataaactatactggttggttagatttagatgaaaagagatattattttacagatgaatatattgcagcaactggttcagtt attattgatggtgaggagtattattttgatcctgatacagctcaattagtgattagtgaaCTCGAGggattaatatat attaatgattcattatattattttaaaccaccagtaaataatttgataactggatttgtgactgtaggcgatgataaatacta ctttaatccaattaatggtggagctgcttcaattggagagacaataattgatgacaaaaattattatttcaaccaaagtg gagtgttacaaacaggtgtatttagtacagaagatggatttaaatattttgccccagctaatacacttgatgaaaaccta gaaggagaagcaattgattttactggaaaattaattattgacgaaaatatttattattttgatgataattatagaggagct gtagaatggaaagaattagatggtgaaatgcactattttagcccagaaacaggtaaagcttttaaaggtctaaatcaa ataggtgattataaatactatttcaattctgatggagttatgcaaaaaggatttgttagtataaatgataataaacactatt ttgatgattctggtgttatgaaagtaggttacactgaaatagatggcaagcatttctactttgctgaaaacggagaaat gcaaataggagtatttaatacagaagatggatttaaatattttgctcatcataatgaagatttaggaaatgaagaaggt gaagaaatctcatattct C-TANCTB 20 MVTGVFKGPNGFEYFAPANTHNNNIEGQAIVYQNKFLTLNGKKYYFDN (amino DSKAVTGWQTIDGKKYYFNLNTAEAATGWQTIDGKKYYFNLNTAEAA acid TGWQTIDGKKYYFNTNTFIASTGYTSINGKHFYFNTDGIMQIGVFKGPN sequence) GFEYFAPANTDANNIEGQAILYQNKFLTLNGKKYYFGSDSKAVTGLRTI DGKKYYFNTNTAVAVTGWQTINGKKYYFNTNTSIASTGYTIISGKHFYF NTDGIMQIGVFKGPDGFEYFAPANTDANNIEGQAIRYQNRFLYLHDNIY YFGNNSKAATGWVTIDGNRYYFEPNTAMGANGYKTIDNKNFYFRNGLP QIGVFKGSNGFEYFAPANTDANNIEGQAIRYQNRFLHLLGKIYYFGNNS KAVTGWQTINGKVYYFMPDTAMAAAGGLFEIDGVIYFFGVDGVKAPGI YGRSMHNLITGFVTVGDDKYYFNPINGGAASIGETIIDDKNYYFNQSGV LQTGVFSTEDGFKYFAPANTLDENLEGEAIDFTGKLIIDENIYYFDDNYR GAVEWKELDGEMHYFSPETGKAFKGLNQIGDYKYYFNSDGVMQKGFV SINDNKHYFDDSGVMKVGYTEIDGKHFYFAENGEMQIGVFNTEDGFKY FAHHNEDLGNEEGEEISYSGILNFNNKIYYFDDSFTAVVGWKDLEDGSK YYFDEDTAEAYIGLSLINDGQYYFNDDGIMQVGFVTINDKVFYFSDSGII ESGVQNIDDNYFYIDDNGIVQIGVFDTSDGYKYFAPANTVNDNIYGQAV EYSGLVRVGEDVYYFGETYTIETGWIYDMENESDKYYFNPETKKACKGI NLIDDIKYYFDEKGIMRTGLISFENNNYYFNENGEMQFGYINIEDKMFYF GEDGVMQIGVFNTPDGFKYFAHQNTLDENFEGESINYTGWLDLDEKRY YFTDEYIAATGSVIIDGEEYYFDPDTAQLVISELEGLIYINDSLYYFKPPV NNLITGFVTVGDDKYYFNPINGGAASIGETIIDDKNYYFNQSGVLQTGVF STEDGFKYFAPANTLDENLEGEAIDFTGKLIIDENIYYFDDNYRGAVEW KELDGEMHYFSPETGKAFKGLNQIGDYKYYFNSDGVMQKGFVSINDNK HYFDDSGVMKVGYTEIDGKHFYFAENGEMQIGVFNTEDGFKYFAHHNE DLGNEEGEEISYS Preferred Aspects: Preferred Polypeptides and Uses Thereof:

1. An isolated polypeptide comprising an amino acid sequence having at least 85%, more preferably at least 90%, even more preferably at least 95%, most preferred 99% sequence identity to the amino acid sequence as set forth in SEQ ID NO: 2.

2. An isolated polypeptide comprising an amino acid sequence having at least 85%, more preferably at least 90%, even more preferably at least 95%, most preferred 99% sequence identity to the amino acid sequence as set forth in SEQ ID NO: 4.

3. The isolated polypeptide of aspect 1 or 2, wherein the polypeptide comprises 19 repeating units derived from the C-terminal domain of toxin A of Clostridium difficile and 23 repeating units derived from the C-terminal domain of toxin B of Clostridium difficile.

4. The isolated polypeptide of aspect 1, wherein the polypeptide has the amino acid sequence as set forth in SEQ ID NO: 2.

5. The isolated polypeptide of aspect 1, wherein the polypeptide has the amino acid sequence as set forth in SEQ ID NO: 4.

6. A polypeptides comprising an amino acid sequence having at least 85%, more preferably at least 90%, even more preferably at least 95%, most preferred 99% sequence identity to the amino acid sequence as set forth in SEQ ID NO: 4.

7. The polypeptide of aspect 6, wherein a hamster vaccinated with said isolated polypeptide survives intragastric administration of a lethal dose of C. difficile spores at all spore doses (10², 10³ and 10⁴).

8. The polypeptide of aspect 6 or 7, wherein the polypeptide comprises 19 repeating units derived from the C-terminal domain of toxin A of Clostridium difficile.

9. The polypeptide of any one of aspects 6 to 8, wherein the polypeptide comprises 23, 33 or 47 repeating units derived from the C-terminal domain of toxin B of Clostridium difficile.

10. The polypeptide of any one of aspects 6 to 9, wherein the polypeptide is selected from the group consisting of SEQ ID: 2, SEQ ID NO: 4, SEQ ID NO. 18, SEQ ID NO: 20 and a polypeptide that is 95%, 96%, 97%, 98%, 99% identical to any of SEQ ID: 2, SEQ ID NO: 4, SEQ ID NO. 18, or SEQ ID NO: 20.

11. The polypeptide of any one of aspects 6 to 10, wherein the polypeptide is isolated.

12. The polypeptide of any one of aspects 6 to 11 for use in medicine.

13. The polypeptide of any one of aspects 6 to 11 for the prevention and treatment of CDAD.

14. The polypeptide of any one of aspects 6 to 11 for the prevention of CDAD in a subject at risk of a CDAD.

15. The polypeptide of any one of aspects 6 to 11 for the prevention of CDAD in a subject at risk of a CDAD, wherein said subject at risk of CDAD is: i) a subject above 65 years of age or a subject below 2 years of age; ii) a subject with AIDS; iii) a subject taking or planning to take immunosuppressing drugs; iv) a subject with planned hospitalization or a subject that is in hospital; v) a subject in or expected to go to an intensive care unit; vi) a subject that is undergoing or is planning to undergo gastrointestinal surgery; vii) a subject that is in or planning to go to a long-term care such as a nursing home; viii) a subject with co-morbidities requiring frequent and/or prolonged antibiotic use; or ix) a subject with recurrent CDAD.

16. The use of the polypeptide any one of aspects 6 to 11 for the manufacture of a medicament for use in medicine.

17. The use of the polypeptide any one of aspects 6 to 11 for the manufacture of a medicament for the prevention and treatment of CDAD.

18. The use of the polypeptide any one of aspects 6 to 11 for the manufacture of a medicament for the prevention of CDAD in a subject at risk of a CDAD.

19. The use of the polypeptide any one of aspects 6 to 11 for the manufacture of a medicament for the prevention of CDAD in a subject at risk of a CDAD, wherein said subject at risk of CDAD is: i) a subject above 65 years of age or a subject below 2 years of age; ii) a subject with AIDS; iii) a subject taking or planning to take immunosuppressing drugs; iv) a subject with planned hospitalization or a subject that is in hospital; v) a subject in or expected to go to an intensive care unit; vi) a subject that is undergoing or is planning to undergo gastrointestinal surgery; vii) a subject that is in or planning to go to a long-term care such as a nursing home; viii) a subject with co-morbidities requiring frequent and/or prolonged antibiotic use; or ix) a subject with recurrent CDAD.

20. A diagnostic kit for detecting a C. difficile infection in a subject comprising the polypeptide of any one of aspects 1 to 11.

Preferred Nucleic Acids:

1a. A nucleic acid comprising a nucleotide sequence encoding any of the polypeptides of any one of aspects 1 to 11.

2a. The nucleic acid of aspect 1a essentially consisting of a nucleotide sequence encoding the polypeptide of any one of aspect 1 to 11.

3a. The nucleic acid of aspect 1a or 2a comprising a nucleotide sequence selected from the group of SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 17 and SEQ ID NO: 19.

4a. The nucleic acid of aspect 1a or 2a essentially consisting of a nucleotide sequence selected from the group of SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 17 and SEQ ID NO: 19.

Preferred Pharmaceutical Compositions:

1c. A pharmaceutical composition comprising the polypeptide of any one of aspects 1 to 11 or a nucleic acid of any one of aspects 1a to 4a and a pharmaceutically acceptable carrier or excipient.

2c. The pharmaceutical composition of aspect 1c, wherein said composition elicits antibodies neutralizing both C. difficile toxin A and B.

3c. The pharmaceutical composition of aspect 1c or aspect 2c, wherein said composition elicits protective immune response in a subject against C. difficile toxin A and B.

4c. The pharmaceutical composition of any one of aspects 1c to 3c, further comprising an adjuvant.

5c. The pharmaceutical composition of aspect 4c, wherein the adjuvant comprises alum.

6c. The pharmaceutical composition of any one of aspects 1c to 5c, further comprising an additional antigen or a drug.

Preferred antibodies: 1d. An antibody directed against a polypeptide of any of aspects 1 to 11, but not recognizing any of or both C. difficile toxin A (SEQ ID NO: 6) and B (SEQ ID NO: 8).

Preferred Methods

1e. A method for producing the polypeptide of any one of aspects 1 to 10 comprising introducing into a host cell a nucleic acid encoding the polypeptide, culturing the host cell under conditions that allow expression of the polypeptide, and isolating the polypeptide.

2e. The method of aspect 1e, wherein the host cell is E. coli.

3e. A method of treating and/or preventing C. difficile associated disease (CDAD) in a subject comprising administering to a subject in need thereof the isolated polypeptide of any one of aspects 1 to 11

4e. A method of inducing a specific immune response against both the toxin A and B of C. difficile in a subject comprising administering the polypeptide of any one of aspects 1 to 11 to a subject or the pharmaceutical composition of any one of aspects 1c to 6c

5e. A method of preventing a primary disease caused by C. difficile infection in a subject comprising administering the polypeptide of any one of aspects 1 to 11 to a subject or the pharmaceutical composition of any one of aspects 1c to 6c.

6e. A method of preventing a primary disease caused by C. difficile infection in a subject at risk of C. difficile associated disease (CDAD), wherein said subject at risk of CDAD is: i) a subject above 65 years of age or a subject below 2 years of age; ii) a subject with AIDS; iii) a subject taking or planning to take immunosuppressing drugs; iv) a subject with planned hospitalization or a subject that is in hospital; v) a subject in or expected to go to an intensive care unit; vi) a subject that is undergoing or is planning to undergo gastrointestinal surgery; vii) a subject that is in or planning to go to a long-term care such as a nursing home; viii) a subject with co-morbidities requiring frequent and/or prolonged antibiotic use; or ix) a subject with recurrent CDAD; comprising administering the polypeptide of any one of aspects 1 to 11 to said subject or the pharmaceutical composition of any one of aspects 1c to 6c. 7e. The method of any one of aspects 1e to 6e, wherein the polypeptide or the pharmaceutical composition is administered to the subject intramuscularly, intradermally, subcutaneously, orally, nasally, or rectally, preferably intramuscularly.

8e. The method of any one of aspects 1e to 7e, wherein the polypeptide or the pharmaceutical composition is administered to the subject within at least two doses in a short time interval (weekly or bi-weekly).

9e. A method of detecting C. difficile in a biological sample comprising contacting the biological sample with the polypeptide of any one of aspects 1 to 11 and detecting binding of the polypeptide to the biological sample, wherein binding of the polypeptide is indicative of the presence of C. difficile in the biological sample. 

The invention claimed is:
 1. An isolated polypeptide comprising an amino acid sequence having at least 85% sequence identity to the amino acid sequence as set forth in SEQ ID NO: 4, wherein the sequence identity is determined by Needleman-Wunsch global alignment.
 2. The polypeptide of claim 1, wherein the polypeptide comprises 19 repeating units derived from the C-terminal domain of toxin A of Clostridium difficile.
 3. The polypeptide of claim 1, wherein the polypeptide comprises 23 repeating units derived from the C-terminal domain of toxin B of Clostridium difficile.
 4. The polypeptide of claim 1, wherein the polypeptide consists of the amino acid sequence as set forth in SEQ ID NO: 2 or SEQ ID NO:
 4. 5. A pharmaceutical composition comprising the polypeptide of claim 1 and a pharmaceutically acceptable carrier or excipient.
 6. An immunogenic or vaccine composition comprising an isolated polypeptide having at least 85% sequence identity to the amino acid sequence set forth in SEQ ID NO: 4 and a pharmaceutically acceptable carrier or excipient, wherein the sequence identity is determined by Needleman-Wunsch global alignment.
 7. The composition of claim 6, wherein the polypeptide comprises 19 repeating units derived from the C-terminal domain of toxin A of Clostridium difficile.
 8. The composition of claim 6, wherein the polypeptide comprises 23 repeating units derived from the C-terminal domain of toxin B of Clostridium difficile.
 9. The composition of claim 6, wherein the polypeptide consists of the amino acid sequence as set forth in SEQ ID NO: 2 or SEQ ID NO:
 4. 10. The composition of claim 6, wherein the composition further comprises a therapeutic agent and is for the prevention and treatment of a C. difficile associated disease (CDAD).
 11. The composition of claim 6, wherein the polypeptide has at least 99% sequence identity to the amino acid sequence as set forth in SEQ ID NO:
 4. 12. The composition of claim 6, further comprising an adjuvant.
 13. The composition of claim 12, wherein the adjuvant comprises alum.
 14. An isolated polypeptide having at least 99% sequence identity to the amino acid sequence as set forth in SEQ ID NO:
 4. 15. A pharmaceutical composition comprising the polypeptide of claim 14 and a pharmaceutically acceptable carrier or excipient. 